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1.  INTRODUCTION 

When  some  of  Che  quantities  involved  in  the  mathematical  model  that 
describes  a  particular  filtering  situation  are  not  completely  known, 
another  model  incorporating  the  available  partial  knowledge  must  be 
adopted.  This  partial  information  can  be,  for  example,  in  the  form  of 
a  prior  distribution  function  if  considering  the  unknown  quantity  as  a 
random  variable,  or  in  the  form  of  an  uncertainty  set,  any  of  the  members 
of  which  can  be  the  actual  quantity.  Then  the  objective  could  be,  in  the 
first  case,  to  maximize  the  expected  value  of  the  original  performance 
measure  or,  in  the  second  case,  to  maximize  the  performance  of  the  optimum 
operating  point  over  the  uncertainty  set. 

Focusing  our  attention  in  the  case  in  which  the  unknown  quantity  is 
modeled  by  an  uncertainty  set,  suppose  that  our  aim  is  to  synthesize  a 
system  that  is  somehow  insensitive  (KObuAt)  to  the  member  of  the  uncertainty 
set  that  is  actually  present.  One  of  the  possible  strategies  to  achieve 
this  is  to  optimize  the  worst-case  performance  (m-mx/nax) .  In  such  case, 
for  any  other  system  there  exists  an  operating  point  in  the  uncertainty  set 
for  which  it  performs  worse  than  the  minimax  system.  This  can  be  considered 
as  a  game  in  which  the  designer  tries  to  guarantee  a  maximum  payoff  assuming 
that  his  opponent  (nature)  will  select  the  worst  possible  operating  point 
for  whatever  system  he  uses. 

Because  the  less  uncertain  the  mathematical  model  is  the  better  the 
resulting  system  will  be,  an  alternative  that  the  engineer  may  consider  is 
that  of  an  adaptive  system;  i.e.,  one  consisting  of  a  subsystem  that  narrows 
down  the  uncertainty  by  learning  a  more  accurate  model  of  the  unknown  system 
from  the  incoming  observations.  This  is  especially  necessary  when  the 


uncertainties  are  so  large  that  trivial  and  robust  filtering  perform 
similarly.  Indeed  the  adaptive  and  minimax  philosophies  constitute  the 
mainstreams  of  the  design  of  uncertain  systems.  The  mutual  advantages 
and  disadvantages  of  both  approaches  have  been  widely  discussed.  To  name 
a  few,  adaptive  systems  are  supposed  to  be  more  complex,  to  have  worse 
performance  for  small  sample  size  problems,  and  thus  inferior  dynamical 
response,  and  to  be,  commonly,  ad-hoc  solutions;  minimax  systems  are 
criticized  as  being  pessimistic  (although  often  this  is  not  the  least 
advisable  engineering  approach),  as  losing  performance  in  some  nominal 
(and  maybe  most  probable)  model,  and  as  being  dependent  on  the  specifi¬ 
cation  of  an  often  arbitrary  uncertainty  class.  Nevertheless  these  two 
philosophies  are  by  no  means  mutually  exclusive.  Since  the  identification 
stage  of  an  adaptive  system  provides  a  model  with  some  uncertainty  (com¬ 
monly  approaching  zero  asymptotically)  it  is  conjectured  that  a  minimax 
processing  system  could  aid  in  improving  the  dynamics  and  region  of  con¬ 
vergence  of  the  overall  adaptive  system. 

A  terminological  comment  is  in  order  here.  Zadeh  [33]  identifies 
adaptivity  with  robustness,  stating  that  a  system  is  adaptive  with  respect 
to  an  uncertainty  class  if  it  performs  acceptably  well  for  all  its  members. 
The  term  robustness  as  referred  to  insensitivity  to  deviations  in  the  model 
has  an  established  usage  in  ~he  areas  of  statistics  and  control  systems, 
and  has  been  used  with  a  minimax  connotation  in  signal  filtering  problems. 

The  application  of  the  minimax  philosophy  of  game  theory  to  filtering 
problems  in  communication  can  be  traced  back  to  the  works  of  Yovits  and 
Jackson  [35 j  and  Nilsson  [36]  ,  dealing  with  signal  estimation  and  matched 
filtering.  However,  the  statistical  works  of  Huber  in  estimation  [30]  and 
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hypothesis  testing  [31]  can  be  considered  as  the  starting  point  of  the 
area  of  minimax  robustness,  successfully  applied  to  a  long  sequel  of 
problems  in  statistical  communication  theory  (surveyed  in  [34 1 , [38] »[39] ) . 
Basically,  minimax  filtering  has  been  applied  to  three  classes  of  linear 
filtering  problems: 

i)  Wiener  filtering:  [26  ]-[28  ] ,  [  35] , [40],[ 41] ,[ 43] ,[ 46],[ 47] 

ii)  Matched  filtering:  [  3] ,[5] ,[  36] ,[37] ,[43]-[45] ,[48  ] 

iii)  Kalman  filtering:  [  10  ]—  [  14 ] ,  ( 19 ]»[A9  ] 

In  this  thesis  we  present  in  the  first  part  several  new  general  results 
(not  specific  to  any  particular  filtering  situation)  and  in  the  second 
part  their  application  to  various  problems  in  linear  and  non-linear  fil¬ 
tering. 

The  cases  in  minimax  robust  filtering  for  which  there  exists  an 
amenable  analytical  solution  are  those  for  which  iaddtz  potnti  exist.  A 
filter  H  and  an  operating  point  P  are  said  to  form  a  saddle  point  if, 
fixing  P,  any  other  filter  different  from  H  has  worse  performance,  i.e. 

H  i6  the  optimal  filter  for  P,  and  if,  fixing  H,  any  other  operating 
point  different  from  P  gives  better  performance,  I.e.  H  has  its  worst 
performance  when  P  is  present.  If  there  exists  such  a  filter  H,  then  it 
is  the  sought-after  minimax  robust  filter,  because  its  worst-case  perfor¬ 
mance  is  attained  at  P  and  any  other  filter  has  worse  behavior  at  P. 
Further,  suppose  that  we  use  optimal  filters  for  every  operating  point  in 
the  uncertainty  class,  then  P  is  the  element  whose  filter  achieves  the 
worst  optimal  performance,  and  hence  will  be  referred  to  as  the  tza.it 
^avoaablz  operating  point.  Note  that  the  saddle  point  property  is  not 
necessary  for  the  robust  filter  to  exist;  however,  if  it  holds,  the  robust 
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filter  has  the  convenient  feature  of  being  the  optimal  filter  for  one  of 
the  operating  points  (the  least  favorable)  and  of  performing  better  at  any 
other  point.  In  this  case  the  minimax  robust  filtering  problem  is  reduced 
to  the  search  for  a  least  favorable,  since  we  usually  suppose  that  the 
derivation  of  the  optimal  filter  for  a  particular  operating  point  is  given 
by  the  classical  theory.  Frequently,  there  exist  analytical  expressions 
for  the  optimal  performance  achievable  at  every  operating  point;  therefore 
the  search  for  the  least  favorable  turns  out  to  be  a  minimization  problem. 

At  this  point,  the  next  step  is  to  ask  for  conditions  under  which  a  saddle 
point  exists.  This  is  the  question  addressed  by  the  minimax  theorems  of 
game  theory  (see  e.g.  [32]);  a  common  assumption  in  these  is  the  concavity- 
convexity  of  the  performance  function  in  the  set  of  filters  and  in  the  set 
of  operating  points,  respectively.  However  it  is  often  the  case  that  the 
filtering  situations  that  we  face  do  not  satisfy  this  assumption.  In  the 
robust  filtering  theorem  of  Chapter  2,  sufficient  conditions  are  provided 
under  which  the  robust  filter  is  the  optimal  filter  for  the  least  favorable 
operating  point.  Roughly,  these  conditions  are  the  convexity  of  the  uncer¬ 
tainty  set,  a  continuity-type  condition  on  the  performance  function,  and 
the  convexity  of  this  function  in  the  set  of  operating  points.  However  this 
by  itself  does  not  assure  the  existence  of  a  saddle  point  since  the  uncer¬ 
tainty  class  may  not  have  a  least  favorable  element;  in  this  respect  the 
robust  filtering  theorem  cannot  be  considered  a  minimax  theorem  since  it 
leaves  open  for  every  particular  uncertainty  set  the  existence  of  least 
favorable  operating  points  (actually,  the  condition  of  requiring  uncertainty 
classes  with  least  favorables  is  relaxed,  and  more  general  classes  are 
allowed).  The  next  general  result  presented  in  Chapter  2  deals  with  the 


frequent  situation  in  which  there  are  several  independent  uncertainties; 
under  some  restrictions  on  the  performance  function  necessary  and  suffi¬ 
cient  conditions  are  given  for  a  filter  and  operating  point  pair  to  be  a 
saddle  point.  These  conditions,  which  appear  in  recursive  form  (and  do 
not  require  convexity  of  the  uncertainty  classes) ,  are  used  successfully 
in  solving  particular  problems  in  the  following  chapters,  and  turn  out  to 
be  very  useful  in  testing  candidates  for  saddle  points.  Another  idea  that 
is  investigated  in  Chapter  2  is  the  soft  minimax  philosophy  that  can  be 
employed  when  the  modeling  uncertainty  is  diminished  by  the  presence  of  a 
nominal,  most  likely,  element. 

In  Chapter  3,  the  general  robust  matched  filtering  problem  is  solved 
using  the  aforementioned  results;  in  particular  the  uncertainties  separation 
theorem  results  in  a  set  of  simpler  and  more  general  conditions  than  those 
previously  published.  Also,  least  favorable  signals  and  noise  for  several 
useful  uncertainty  classes  are  obtained  in  the  discrete-time  case,  and  the 
optimal  design  of  signals  to  combat  their  uncertainty  in  reception  is  inves¬ 
tigated  as  well. 

The  problem  of  designing  linear  observers  and  regulators  for  stochastic 
systems  in  both  continuous  and  discrete-time  with  uncertain  second  order 
statistics  is  addressed  in  Chapter  4.  Once  again,  in  this  case  the  appli¬ 
cation  of  the  general  results  allows  the  conjoint  resolution  of  the  estima¬ 
tion  and  control  problems.  The  soft  minimax  solution  of  the  unknown  first- 
order  statistics  case  is  shown  to  be  equivalent  to  earlier  works  in  the 
tracking-evasion  games.  Particular  uncertainty  classes  of  covariance  matrices 
are  studied  and  some  technical  errors  in  a  treatment  of  minimax  Kalman 
filtering  which  appeared  in  the  literature  are  pointed  out. 
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In  Chapter  5,  several  further  applications  are  considered.  Some 
general  results  for  Wiener  filtering  are  provided  and  a  critique  of  the 
usually  employed  penalty  function  is  presented.  Next  the  robust  filtering 
theorem  and  the  uncertainties  separation  theorem  are  applied  to  two  problems 
in  random-signal  detection,  namely  the  quadratic  receiver  and  the  output- 
energy  filter.  Finally,  a  non-filtering  application  is  demonstrated  by 
providing  a  minimax  result  for  hypothesis  testing  and  an  alternative  proof 


of  the  classical  minimax  robust  location  estimation  theorem. 
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2 .  GENERAL  RESULTS 

2.1  Formulation 

In  an  unconstrained  sense,  given  some  external  situation  (operating 
point) ,  and  a  processing  device  (filter),  a  quantitative  measure  of  per¬ 
formance  is  assigned.  The  classical  filtering  problem  is  to  find  a 
filter  that  maximizes  that  figure  of  merit  for  a  particular  operating 
point.  As  discussed  in  the  previous  chapter,  when  the  operating  point 
is  not  exactly  known,  one  of  the  alternatives  is  to  search  for  a  (maximin) 
filter  with  an  optimum  lower  bound  of  performance.  The  goal  of  this  chapter 
is  to  provide  some  general  tools  that  can  be  applied  in  a  wide  variety  of 
these  filtering  situations  with  uncertain  operating  points.  Denote  by  3C 
the  space  of  filters  and  by  <2  the  space  of  operating  points.  The  payoff 
function  M  is  a  real  functional 

M(  • ,  •)  :  K  *  0  -*■  IR 

The  triple  (3f, Q.* M)  defines  the  type  of  filtering  situation.  Suppose  H  C  3f 
and  Q  C  0  are  the  sets  of  allowable  filters  and  operating  points  respectively. 
According  to  the  standard  terminology  the  triple  (H,Q,M)  will  be  referred  to 
as  a  game.  The  following  definitions  will  be  used: 

M  (q)  *  sup  M(h,q);  (2.1) 

h  €  H 


M(h  (q) ,q)  =  M*(q) ;  (2.2) 

q^  is  a  least  favorable  operating  point  for  (H,Q,M)  if 
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q  *  arg  min  M  (q) 

q  6  Q 


(2.3) 


(Note  that  throughout  this  thesis  we  use  the  notation  x.  ■  arg  max/min  f(x) 

0  x  €  A 

if  f(xft)  *  max/min  f(x),  without  implying  neither  existence  nor  uniqueness.) 
x  €  A 

(hL»qL)  6  H  x  Q  is  a  regular  pair  for  (H,Q,M)  if  for  every  q  6  Q  such 
that  x  *  (l-a)q  +  aq  €  Q  for  all  0  <  a  <  1, 

Cl  L* 


M  (x  )  -  M(h  ,x  ) 

llm  - 5 - k_2_  ,  o 

a  +  0  a 


(2.4) 


(h^,qL)  €  H  x  q  is  a  saddle  point  solution  to  the  game  (H,Q,M)  if  for  every 


h  €  H  and  q  €  q 


M(h,qL)  <  M(hL,qL)  <  M(hL,q) 


(2.5) 


hR  is  a  maximin  robust  filter  for  the  game  (H,Q,M)  if 


h  *  arg  max  inf  M(h,q) 
h  €  H  q  €  Q 


(2.6) 


2.2  Robust  filtering  theorem 

In  this  section  we  give  some  general  results  pertaining  to  the  solution 
of  the  problem  described  by  (2.6). 


Lemma  2.1 

Let  f:  [0,11  +  1  be  a  convex  function.  Then  f(0)  <  f(a)  fa  €  [0,1]  if 

and  only  if  lim  —  [f(a)-f(0))  exists,  is  finite  and  nonnegative, 
a  i  0  01 


Proof 


This  is  an  extension  of  the  proof  given  in  [11  for  an  open  interval. 


9 


I 

I 

I 

\ 

I 

I 

I 

I 

1 

1 

1 


First  we  show  that  J  [f(x)  -  f(0)]  is  monotone  non-decreasing  in  x  6  (0,1]  . 
Suppose  0  <  x  <  x' ,  because  f(*)  is  convex 

f(x)  <  f(x')  •  Jr  +  f (0)  - 

or  equivalently 

-  tf(x)  -  f (0)]  <  Jr  [f(x*>  -  f (0)1 
Now  we  can  prove  the  only  if  part : 

Suppose  f(0)  <  f (a)  Va  €  [0,1]  ,  then  Va  €  (0,1]  ,  -  [f(a)  -  f (0)]  >  0, 

a 

but  since  this  function  is  monotone  non-decreasing  lim  —  [f(oc)  -  f (0)] 

«  ot 
ot  +  0 

exists,  is  finite  and  nonnegative. 

For  the  reverse  implication,  assume  that  lim  —  [f(a)  -  f(0)]  exists, 

a  +  0  a 

is  finite  and  nonnegative,  then  for  every  x  €  (0,1] ,  3z<*  such  that 
Y  [f (z)  -  f (0) ]  >  0,  but  since  this  function  is  monotone  non-decreasing  in 
2,  J  lf(x)  "  f (0 ) ]  >  0,  hence  f(x)  >  f(0). 

Lemma  2.2 

Suppose  Q  is  a  convex  set,  and  for  every  h  €  h,  M(h,0  is  convex  in  Q. 
With  respect  to  the  game  (H,Q,M),  a  regular  pair  (h  ,q  )  is  a  saddle  point 

la  la 

solution  if  and  only  if  q^  is  least  favorable. 

Proof 

The  left  inequality  in  the  definition  of  saddle  point  (2.5)  is  satis¬ 
fied  for  every  regular  pair,  because  particularizing  the  regularity  condition 
(2.4)  for  q  ■  qL»  it  follows  that 

M*(qL)  -  M(hL,qL) 


(2.7) 
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According  to  Lemma  2.1,  the  right  inequality  is  satisfied  if  and  only  if 


M(h  ,x  )  -  M(h  ,q  ) 

A(q)  -  lim  - -  -  - - >  0 

a  +  0 


(2.8) 


for  every  q  €  Q. 

Now,  note  that  M*(0  is  convex  in  Q  since  with  q^  *  (l-a)q^  +  au^,  for 
all  0  <  a  <  1,  and  q, ,q9  €  Q,  we  have 


sup  M(h,qn)  <  sup  { (1  -  a)M(h,q1 )  +aM(h,q?)) 
h€H  hSH 

*  (1  -  a)  sup  H(h,q1)  +  a  sup  M(h,q„) 
h  6  H  h  6  H 


(2.9) 


where  the  inequality  follows  from  the  assumed  convexity  of  M  in  Q  for  every 
h  e  H. 

Again  by  Lemma  2.1,  since  M  (•)  is  convex  in  Q,  q^  is  a  least  favorable 
operating  point  if  and  only  if 


B(q)  ■  lim 
a  +  0 


M*(x  )  -  M*(q.) 


(2.10) 


for  every  q  €  Q. 

Considering  that 


M*(xa)  -  M*(qL) 


M  (x  )  -  M(hT , x  )  +  M(h.  ,x  )  -  M(h  ,q  )  (2.11) 

Ql  L  Cl  L  Ct  L  Li 


and  that  (q  ,h. )  is  a  regular  pair,  taking  lim  j-  [  ]  of  both  sides  of 

L  L  a  +  0 


2.11,  we  get  that  for  any  q  e  Q 


A(q)  -  B(q)  , 


(2.12) 


and  therefore  the  lemma  is  proved. 
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Theorem  2.1 

Suppose  chat  (K,P,M)  is  a  game  with  the  following  properties: 

1)  There  exists  a  convex  set  Q,  P  C  q  C  Q  such  that 

Inf  sup  M(h,q)  ■  inf  sup  M(h,q)  (2.13) 

q€Ph£K  q€Qh€K 

11)  There  exists  a  set  H,  such  that  H  C  K  C  }f  and  for  every  q  €  Q 

sup  M(h,q)  »  sup  M(h , q) 
h  €  K  h  €  H 

ill)  M(h,*)  is  convex  in  Q  for  all  h  €  H. 

Then,  if  a  least  favorable  for  (H,Q,M)  qL»  and  its  optimal  filter  hL 
form  a  regular  pair  for  (H,Q,M) ,  h^  is  a  maximin  robust  filter  for  the  game 
(K,P,M). 

Proof 

According  to  Lemma  2.2,  if  a  least  favorable  for  (H,Q,M),  q^,  and  its 
optimal  filter,  h^,  form  a  regular  pair,  (h^.q^)  is  a  saddle  point  solution 
to  the  game  (H,Q,M).  Therefore,  we  have 

M(hL,qL)  *  inf  M(hL,q) 

q  €  Q 

<  inf  M(h  ,q) 
q  e  P 

<  sup  inf  M(h,q)  (2.14) 

h  S  K  q  €  P 

where  the  first  inequality  follows  from  P  C  Q  and  the  second  from  6  H  C  k. 


Furthermore, 


M(h  ,q  )  -  sup  M(h,q  ) 

L  h  £  H  L 

>  inf  sup  M(h,q) 

q  €  Q  h  £  H 

■  inf  sup  M(h,q) 

q  £  P  h  £  K 


(2.15) 


where  Che  inequality  follows  from  q^  £  Q  and  Che  last  equality  from  assump¬ 
tions  i)  and  ii) .  But  for  an  arbitrary  game  we  have  (e.g.  [2]) 


sup  inf  M(h,q)  <  inf  sup  M(h,q) 
h  £  K  q  £  P  q€Ph€K 


(2.16) 


Therefore  the  previous  inequalities  in  (2.14)  and  (2.15)  are  transformed  into 
equalities.  In  particular. 


sup  inf  M(h,q)  ■  inf  M(hT ,q) 
h  €  K  q  €  P  q  £  P 


(2.17) 


and  therefore  h^  is  a  maximin  robust  filter  for  (K,P,M),  as  we  wanted  to  show. 


Remarks 

The  utility  of  this  result  stems  from  the  fact  that  usually  explicit 
* 

expressions  for  M  (•)  are  available,  and  therefore  the  original  maximinimi¬ 
zation  problem  is  reduced  to  a  minimization  one,  namely  the  search  for  least 
favorables.  This  will  be  illustrated  extensively  in  the  next  chapters  with 
the  application  of  the  general  results  to  specific  filtering  situations. 

It  seems  convenient  to  underline  the  meaning  of  the  introduction  of  a 
game  (H,Q,M)  different  from  the  original  (K,P,M)  in  the  theorem.  The  key 
point  is  the  existence  of  a  game  with  saddle  point,  in  order  that  Lemma  2.2 
can  be  used.  On  one  hand,  the  introduction  of  the  set  Q  allows  the  solution 
of  problems  in  which  the  original  uncertainty  set  either  is  not  convex  or 


has  no  least  favorable.  Note  that  in  the  case  in  which  inf  sup  M(h,q) 

q  €  Q  h  €  K 

is  attained,  the  existence  of  a  rr'ust  filter  is  assured.  On  the  other 
hand,  the  introduction  of  the  set  H  allows  the  solution  of  the  robust 
filtering  problem  for  general  sets  of  filters  (class  theorems),  while 
dealing  with  easier,  restricted  sets. 

However,  the  most  relevant  remark  is  that  important  robust  filtering 
situations  (as  will  be  seen  in  the  next  chapters)  cannot  be  solved  by  using 
the  well-known  game-theoretic  minimax  theorems,  because  their  respective 
payoff  functions,  although  convex  in  the  uncertainties,  are  not  concave  in 
the  filter  sets. 

2.3  Uncertainties  separation  theorem 

Let  (H,Q,M)  be  a  game  in  which  Q  is  the  cartesian  product  of  independent 
uncertainty  classes 

Q  *  Qx  x  Q2  x  • • •  x  Qk 

and  the  payoff  function  can  be  put  as 

M(h,(qi,...,qk))  -  F(f1(h,q1),...,fk(h,qk))  (2.18) 

with  F:  ffi.  -*■  IR  non-decreasing  in  each  one  of  its  arguments. 

Define,  if  they  exist,  for  i  *l,...,k 

q.(h)  *  arg  min  f,(h,x)  (2.19) 

x  6  Qi 

Theorem  2.2 

a  sat^e  Point  of  (H,Q,M)  if  and  only  if  hL  is  solution  of 


the  equation 


14 


h 


L 


arg 


* 

max  MOi.Cq^h^) 
h  £  H 


,q*(hL» 


and 

*  * 
qL  *  (q1(hL),*",qk(hL)) 


Proof 


Only  if.  Suppose  (h^,q^)  is  a  saddle  point  of  (H,Q,M),  then 


M(hL,q  )  -  min  M(hL>q)  -  min^  M(hL«q) 
q€Q  qSQ 


for  all  Q  C  Q,  such  that  q^  €  Q.  In  particular  for  Qi  *  {qL1}  x  {qL2) 
x  x  . . .  x  {q  }  with  qL  *  (q^ , . . .  .q^)  .  This  J-eads  to 


nOVV  -  min  M(hL,(qL1,...,x,qL(  )t...,qLk» 

X  € 


Equivalently 


F(f^(h^,q^)  , .  •  • ,  f  j  (hT  ,qT  4  fj,  (hT  »qT  k.)) 


iv  L’^Li 


kv  L’^Lk' 


"  minn  F(fl(hL’qLl) 

x  e  q± 


,fi^hL,X^  *  * ’ ' ,fk^hL,qLk^ 


and  since  F  is  non-decreasing  in  the  i-th  argument 

* 

qLi  -  argmin  f^h^x)  -  q^h^ 
x  S  Qi 


Moreover,  if  (hL,qL>  is  a  saddle  point, 
h  -  argmax  M(h,q  ) 

“  1_  T?  *"* 


Then,  the  last  two  equations  result  in  (2.20). 


(2.20) 


(2.21) 


(2.22) 


(2.23) 


(2.24) 


(2.25) 


A. 
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If.  From  the  definition  of  q^(h^)  and  using  the  monotonicity  of  F  in 
every  argument  we  have  that 

M(hL,qL)  -  M(hL,(q*(hL),...,q*(hL>) 

-  min  M(h,q)  (2.26) 

q  S  Q 

Furthermore,  (2.20)  is  the  remaining  condition  in  order  for  (^L»qL)  to  be 
a  saddle  point  of  (H,Q,M). 

Corollary 

Suppose  the  game  (K,P,M)  satisfies  the  conditions  stated  in  Theorem 
2.1,  and  h^  is  solution  to  the  equation  (2.20),  then  h^  is  a  maximin  robust 
filter  for  the  game  (K,P,M). 

Proof 

ft  ft 

Letting  q^  *  (q^(hL) , . . . .q^Oi^)) ,  (hL><lL)  is  a  saddle  point  of  (H,Q,M) . 
Then  the  same  proof  of  Theorem  2.1  can  be  used. 

Remarks 

Although  Theorem  2.2  is  quite  apparent  (for  k  *  1  it  is  nothing  else 
than  the  definition  of  saddle  point),  it  represents  a  powerful  tool  in  order 
to  derive  conditions  for  least  favorability ,  and  in  order  to  check  candi¬ 
dates  for  saddle  points. 

This  result  is  especially  attractive  for  an  iterative  numerical  solution 
of  the  robust  filtering  problem,  i.e.,  for  a  given  filter  the  solution  of 
the  set  of  equations  (2.19)  can  be  computed  and  incorporated  into  equation 
(2.20)  which  in  turn  can  be  solved  for  a  new  filter.  Furthermore,  if  the 
uncertainty  subclasses  are  such  that  there  exists  an  analytical  solution  (in 
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terms  of  a  generic  h)  for  q^Ch),  i  *  then  every  uncertainty  can  be 

solved  separately  (as  if  all  the  others  were  constant),  getting  finally  the 
robust  filter  from  the  (also,  frequently  analytic)  solution  of  (2.20). 

One  particular  case  in  which  there  exists  an  explicit  solution  for 
the  least  favorable  and  that  can  be  found  repeatedly  in  the  next  chapters 
is  given  by  the  following  result. 

Lemma  2.3 

Suppose  that  the  uncertainty  subclass  is  a  subset  of  a  Hilbert  space  S 
and  is  some  neighborhood  around  a  nominal  element  described  by 

Q±  -  {*  €  5,  II x  -  xQ  II  <  A}  (2.27) 


where  the  norm  is  that  of  the  Hilbert  space  S.  Suppose  that  f^(h^,’):  S  -*■  1R 
is  a  continuous  linear  functional,  then 


q*(hL) 


xo  '  A 


z(h  ) 


(2.28) 


where  z(h^)  is  such  that  for  all  x  £  S 
fA(hL,x)  -  <  x,z(hL) > 


(2.29) 


Proof 

Since  f,(hT>*)  is  a  continuous  linear  functional  the  existence  of  an 

1  la 

element  z(hT)  €  S  fulfilling  (2.29)  is  assured  by  the  Riesz  representation 
theorem  (e.g.  [8]).  Note  that  for  all  x  S  Qi  the  Schwarz  inequality  results 
in 


j<x-x0,z(hL)>|  <  II  x  -  x0  II  II  zOi^)  II  <  A  II  z (hL)  II 


(2.30) 
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Also, 


z(h-) 

<  x  -  (xQ  -  A  ii"z'(h  J]j )  »  z(hL)  )  *  <  x-xQ,z(hL>  >  +  A||z(hL)  II  >  0  (2.31) 

Tj 

where  the  inequality  follows  from  (2.30).  Therefore  we  have  for  all  x  ^  Q± 

fi(hL’x)  >  fi(hL’  X0“A  llz(h^)  II )  (2,32) 

and  (2.28)  follows. 

2.4  Soft  minitnax  filtering 

Sometimes  there  exists  a  nominal  operating  point  q^,  such  that  our 
uncertainty  on  the  actual  operating  point  is  lessened  by  the  fact  that  points 
closer,  in  some  sense,  to  the  nominal  are  more  likely  to  occur  than  those 
more  distant  from  it.  This  can  be  incorporated  into  our  model,  modifying  the 
payoff  function  in  such  a  way  that  points  in  the  uncertainty  class  are  penalized 
according  to  some  distance  from  the  nominal  (D(q,qg)).  Hence  for  soft  minimax, 
we  have  the  payoff  function 

Mg(h,q)  =*  M(h,q)  +  D(q,q0)  (2.33) 

Note  that  now,  the  elements  closer  to  q^  are  stronger  candidates  to  be  the 
worst  case.  All  the  results  derived  before  can  be  applied  to  the  game  with 
payoff  function  Mg.  Directly  from  the  definition  of  regularity  we  can  see 
that  (h^,qL)  is  a  regular  pair  for  (H,Q,M)  if  and  only  if  it  is  for  (H,Q,Ms). 
Besides  it  is  not  unusual  that  the  distance  function  is  convex  in  the  uncer¬ 


tainties;  if  so,  condition  iii)  in  Theorem  2.1  is  also  equivalent  for  both 
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types  of  minimax  filtering.  However  the  existence  of  a  least  favorable  for 
(H,Q,M)  does  not  guarantee  its  existence  for  (H,Q,M  ),  even  when  D  is  convex. 
An  example  is  sufficient  in  order  to  verify  this. 

Consider  the  convex  set 

Q  -  {(xrx2):  0  <  xr  x2  <  1}U{(0,1)}U{(1,0)}<=  ]R2 

and  the  convex  functions  f(x)  *  x^,  g(x)  *  x2*  Obviously 

min  f(x)  *  f((0,l)) 
x  €  Q 

min  g(x)  *  g((l,0) ) 

x  €  Q 

But  there  exists  no  z  6  Q  such  that 


f(z)  +  g(z)  -  0  *  inf  { f(x)  +  g<x)} 


3 .  MATCHED  FILTERING 


3.1  Robust  Matched  Filtering  Theorems 

The  linear  system  that  maximizes  the  output  signal-to-noise  ratio  at 
some  instant  of  time,  when  the  input  is  a  deterministic  signal  embedded  in 
additive  random  noise  is  known  as  the  matched  filter  for  this  pair  of  signal 
and  noise.  If  the  noise  is  a  Gaussian  process,  then  the  output  of  this 
filter  in  the  instant  in  which  the  signal-to-noise  ratio  is  maximized  pro¬ 
vides  a  sufficient  statistic  for  any  likelihood  ratio  detection  test  of  the 
input  signal.  Since  the  power  of  the  noise  at  the  output  of  the  linear 
filter  depends  on  the  second-order  statistics  of  the  input  noise,  a  complete 
specification  of  the  signal  and  autocorrelation  of  the  noise  is  necessary 
and  sufficient  in  order  to  derive  the  corresponding  matched  filter.  Due  to 
modeling  uncertainties  or  changing  operating  environments  it  is  possible 
that  the  second  order  characterization  of  the  noise  is  not  completely  known. 
Also,  channel  nonlinearities  tend  to  distort  the  signal  in  an  unpredictable 
(or  difficult  to  ascertain)  way.  In  these  cases  it  is  interesting  to  design 
a  robust  matched  filter,  i.e.  a  filter  that  gives  the  optimum  -  in  some 
sense  -  behavior  within  the  uncertainty  region.  Poor  [3]  showed  that  under 
some  mild  restrictions  the  design  of  the  robust  matched  filter  in  the  maximin 
sense  -  the  most  widely  used  in  robust  decision  problems  -  is  equivalent  to 
finding  the  least  favorable  possible  pair  of  signal  and  noise.  In  this  chap¬ 
ter  we  derive  the  central  results  of  robust  matched  filtering  by  using  the 
theorems  proved  in  Chapter  2,  and  present  some  analytical  solutions  for 
particular  uncertainty  classes  in  the  increasingly  important  case  of  discrete 
time  processing  [4)  . 
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A  general  formulation  of  the  matched  filter  design  problem  that  allows 
the  description  of  the  input  pair  of  signal  and  noise  in  various  ways  has 
been  given  in  [3] .  Let  a  signal  quantity  (in  the  time  or  frequency  domain) 
be  s  €  X,  a  noise  quantity  (e.g.,  covariance  matrix  or  autocorrelation 
function)  be  n  €  3C,  and  a  filter  quantity  (e.g.  impulse  response  or  trans¬ 
fer  function)  be  h  £  X,  where  X  is  a  Hilbert  space  with  inner  product 
<  • ,  •  > and  S'  is  a  space  of  bounded,  linear,  (self-adjoint)  positive  operators 
mapping  X  to  itself.  The  real  valued  functional  defined  by 

p (h;s,n)  =  |<h,s>|2/(h,nh>  (3.1) 

represents  the  output  signal-to-noise  ratio  of  the  filter  at  some  time 

instant,  for  the  usual  descriptions  of  signal  and  noise,  either  in  continuous 

or  discrete  time.  Note  that  in  order  for  this  definition  to  make  sense,  the 

quantity  n  should  represent  a  second  order  characterization  of  the  noise. 

Therefore,  the  filtering  situation  defining  triple  is  (X,X x  X ,p) .  By  direct 

application  of  the  Schwarz  inequality  the  optimal  (matched)  filter  for  (s,n) 

* 

is  such  that  n-h  (s,n)  -  s.  Parallel  to  the  notation  in  (2.1),  we  write 

A  it 

P  (s,n)  *  sun  p(h;s,n)  3  <s,  h  (s,n)>  (3.2) 

h  £  X 

Now  suppose  that  the  signal  and  noise  pair  (s,n)  belongs  tc  an  uncertainty 
class  P.  (A  particular  case  is  when  the  uncertainties  in  signal  and  noise 
are  independent  in  which  case  P  =  S  x  N) .  Thus  we  have  the  game  (3C,P,p). 

A  pair  of  signal  and  noise  (s  ,n  )  £  P  is  said  to  be  least  favorable  for 
matched  filtering  if 

* 

(s  ,n  )  »  arg  min  P  (s,n) 

L  L  (s,n)€P 


(3.3) 
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Now  we  compute 

P  (<J,v)  -  p(hL;<r,v)  *  nr\vh~~  t<0»h*(°»v)>(hL»vhL>  ~  l<0»hL>|2]  (3.5) 

L  L 

The  numerator  can  be  expanded  in  the  expression 

[  (1  -  a)2  (  sL>h*(sL,v)  )  +  a2  <  s,h  (s,v)  )  + 

ot(l-a)(<  s,  h*(sL,v)>  +  <  sL,h*(s,v)  >)1  [<hL,vhL>]  - 

I  (l-a)2(<sL,hL>)2  +  a2|<s,hL>|2  +  2a(l  -  a)<  sL,hL>  Re<  s,hL>  ] 

*  (1  -  a)2  < sL,hL> [<  sL>  h*(sL,v) )  -  <  sL>hL>  1  + 

2a  (  sL»hL*  t  Re  (  s,h  (s^,v)  >  -  Re  <  s,hL>  ]  + 

a  < sL,h  (sL,v)>  ( <  hL,  nhL>  -  < hL,nLhL>)  +  o (a)  (3.6) 

* 

Where  we  have  used  that  h  (*,v)  is  self-adjoint.  The  first  term  of  the  right 
hand  of  (3.6)  is  given  by  (3.4);  therefore,  the  numerator  of  (3.5)  is 

(1  -  a)a  [<  sL»hL>  “  <  sL>h  (nhL,v) >  ]  <  sL,hL>  + 

2a  (  sT  ,h  >  l  Re  <  s,h  (s  ,v)  >  -  Re  <  s,h  >  ]  + 
a  <  sL>h*(sL>v) >  (<  hL,nhL>  -  <  sL»hL>)  +  o (a) 

*  a  [<sL,hL>  <sL,hL~h  (s^.u))  + < sL,h  (sL>v)>  <hL,nhL>  - 

<s^,hL>  < h*(s^,v) »nhL>  ]  +  o (a)  (3.7) 

Using  assumption  3,  the  term  in  brackets  goes  to  zero  when  a  4-  0,  therefore 
(3.5)  is  o (a)  and  consequently  (s^,nL)  is  a  regular  pair  since  (s,n)  is 


arbitrary. 
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Remarks 

For  the  sake  of  clarity  Che  theorem  is  not  stated  in  its  most  general 
form.  It  is  enough  that  the  uncertainty  class  P  has  property  i)  of  Theorem 
2.1.  With  respect  to  the  continuity  condition  required  by  the  theorem,  we 
have  the  following 


Lemma  3.1 


If 


°L 


is  invertible  then 

*  * 

lim  h  (sL«v)  *  h  (sL,nL) 


for  all  n  €  3f  such  that  (sL>n)  €  P,  where  v  *  (1  -  cOn^  + 


an. 


Proof 


I  v  x  II  *  II  (<1  -  0)0^  +  an)x  II  >  (1  -  a)llnL  xll  -  a  llnxl 
>  (l-a)lln^xll  -  all  nil llxll 


(3.8.1) 


where  the  last  inequality  follows  because  n  is  bounded.  If  n^  is  invertible 
(one  to  one  and  onto)  then  (Theorem  21.3  [21])  there  exists  e  >  0  such  that 
for  every  vector  x 


II  n^  x  II  >  t  II  x  II 

Now  fix  t  such  that  0  <  t  <  e/(e  +  II nil  ) .  Then  according  to  (3.8.1)  for 
every  a  €  [0 , t ]  and  every  x, 


II  v  xll  >  611x11 


(3.8.2) 


with  5  *  (1  -  t)e  -  til n II  >  0. 
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Since  v  is  self-adjoint  positive 
X  m  Null  (v)-1"  -  Range  (v) 

therefore,  the  range  of  v  is  dense.  This  fact  and  (3.8.2)  imply  that  v  is 
invertible  (Theorem  21.3  [21]).  If  we  let  x  ■  v  in  (3.82)  for  every 
vector  y  we  have 

II  y  II  *  II  vv  ^y  II  >  6 II  v  ^y  II 

and  therefore  for  every  a  £  [0,t] 


-1, 


sup 

y 


liij  <  l 

II  y  II  5 


(3.8.3) 


The  existence  of  v  (for  a  6  [0,t])  implies  that  the  following  is  true 


ie  k  —1  k 

h  (sL,v)  -  h  (sL,nL)  -  v  (nL~v)h  (sL>nL) 


Taking  norms 


Ih  (sL,v)  -  h  (sL,n^)ll  <  II  v  *  II  II nL  —  v  II  II h  (s^.n^ll 

<  a  •  h  II  nT  -  n  II  II  h  (s  ,n  )ll 
5  L  L  L 


(3.8.4) 


and  the  lemma  is  proved. 

Note  that, by  the  Banach  inverse  theorem  [9]  ,  since  v  is  linear  and 
continuous  its  inverse  (it  is  exists)  is  bounded.  However  this  would  not 
be  enough  for  the  previous  proof,  since  it  requires  that  v  ^  is  uniformly 
bounded  in  a  neighborhood  of  o  ■  0. 

The  conclusion  is  that  the  invertibility  of  is  sufficient  for 
condition  3  to  be  true  (via  Lemma  3.1  and  Schwarz  inequality). 
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Now  we  are  going  to  direct  our  attention  to  the  important  special  case 
in  which  the  uncertainties  in  signal  and  noise  are  independent  of  each  other 
and  therefore  P  *  S  *  W. 

Theorem  3.2 

(h^,(s^,n^))  is  a  saddle  point  of  (J C,S*N,p)  if  and  only  if 
l-  °LhL  *  SL 

2.  |<s,hL>|  >  |<sL,hL>|  Vs  e  s  (3.9) 

3.  <  hL,(nL-n)hL>  >  0  Vn  €  W  (3.10) 


Proof 

The  payoff  function  for  matched  filtering  (SNR),  (3.1),  can  be 
expressed  as 

P(h;s,n)  -  F(|<s,h>|,«h,nh>)"1)  (3.11) 

2  2 

with  F:  1R  -+■  R,  F(x,y)  »  x  y.  Consequently  the  uncertainties  separation 
theorem  can  be  applied  and  equations  (2.20)  and  (2.19)  result  in  conditions 
1,  and  2-3  straightforwardly. 

Remark 

In  [3]  a  similar  result  (for  convex  classes)  was  derived  with  conditions 
2  and  3  replaced  by 

2  Re  {<  s,hL>}  -  <  sL,hL>  -  <  hL>nhL>  >  0  (3.12) 

for  every  (s,n)  6  S*N.  In  (5)  it  was  shown  that  this  is  equivalent  to 
condition  3  and 

Re  {<  a,hL>}  >  <  aL,hL>  Vs  €  5 


(3.13) 
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It  is  interesting  to  note  that  if  S  is  convex  (Theorem  3.2  does  not 

put  any  restriction  on  S  and  N)  (3.9)  and  (3.13)  are  equivalent.  First, 

it  is  obvious  that  (3.13)  implies  (3.9).  Now  suppose  there  exists  s^  £  S 

such  that  Re{<  s, ,h_  )}  <  <  sT ,h_  >  and  consider  a  *  (1  -  a)s,  +  as, . 

1  L  L  L  Ot  L  1 

l<  VhL>l2  "  Id -<*)<  sL»hL>  +  ^  + 

[a  Im  <  s^.h^)  1  ^ 

*  (<sL,hL>)2  +  a2 (Re  (s^h^)  -  <sL,hL>)2  + 

2a  <  s^,hL>  (Re  ^  S1’^L^  ~  (  sL’^L^  + 

a2(Im < slthL>)2  (3.14) 

Denoting  A  *  (  s^.h^)  -  Re  <  s^,hL>  >  0,  for  every  a  such  that 

0  <  a  <  2A  <  sL,hL>/(A2  +  (Im  <  s^h^)2)  (3.15) 

2  1 2 

we  have  that  |< a  ,h  >|  <  |<  s  ,h  >|  .  therefore,  since  S  is  convex,  a  £  S 

Gt  L  L  L  (* 

and  (3.9)  does  not  hold. 

3.2  Application  to  Discrete  Time 

One  of  the  most  important  instances  in  which  this  problem  appears  is 
in  Direct  Sequence  Spread  Spectrum  systems.  In  this  case,  the  optimum 
linear  processor  is  an  analog  filter  matched  to  the  entire  codeword  wave¬ 
form.  However,  because  of  technological  or  versatility  considerations  it 
is  often  preferred  to  use  a  discrete-time  matched  filter  whose  input  can 
proceed  from  a  direct  sampling  of  the  received  codeword  or  from  the  sampling 

of  the  output  of  an  analog  filter  matched  to  the  chip  waveform. 

k  T 

In  the  above  formulation,  let  W  -  IR  ,  s  *  (Sq,...,s^^1  ,  h  * 
t  ~  - 

{Hq,  . . .  ,h^_1J  ,  where  hi  -  and  st*  hj.  are  the  values  signal  and 
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filter  impulse  response  respectively  at  the  i-th  sample.  The  inner  product 

T 

is  defined  as  the  usual  scalar  product:  (a,b>  *  a  b;  and  the  noise  des- 
criptor  is  n  S  Jf  C  ]R  ,  a  positive-definite  symmetric  matrix,  representing 
the  covariance  matrix  of  the  zero-mean  input  noise.  It  is  easy  to  see  that 
with  these  definitions,  (3.1)  gives  the  power  of  the  filter  output  due  to 
the  signal,  divided  by  the  variance  of  the  filter  output  due  to  the  noise, 
at  the  k-l-th  sample. 

Note  chat  in  this  (finite  dimensional)  case  the  noise  operator  (assumed 

* 

to  be  a  positive  definite  matrix)  is  always  invertible  (thus  h  (s,n)  exists 
for  every  pair)  and  through  Lemma  3.1  the  continuity  condition  of  Theorem 

3.1  is  always  satisfied. 

In  the  sequel,  we  will  deal  with  simple  and  general  uncertainty  classes 
in  order  to  illustrate  the  search  for  least  favorable  pairs. 

3.2.1  Signal  Uncertainty 

We  consider  here  two  classes  of  uncertainties  described  by  a  bound  on 
the  ^  and  norm  of  the  deviation  from  a  given  nominal  s^. 

(^2  uncertainty]  .  Let  ng  be  the  covariance  matrix  of  the  input  noise,  and 
the  class  of  allowable  signals  defined  by: 

Sx  -  {s  €  lRk,  ||  s  -  sQll  <  M  (3.16) 

The  least  favorable  signal  s^  depends  on  Uq  and  is  given  by  the  following: 

Proposition  1 

-1  2 

s  »  argmin  <  s,n  s  >  *  s.  -  o  h. 

L  c  0 

s  € 


(3.17) 
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Proposition  2 

The  -  nominal  -  filter  matched  to  (Sq^q)  is  robust  for  deviations 
from  Sq  defined  by  the  class  (3.16)  if  and  only  if  Sq  is  an  eigenvector 
of  nQ. 

Proof 

First  note  from  (3.1)  that  the  performance  of  filter  h  is  not  affected 
by  scaling  the  impulse  response  by  a  constant,  so  the  nominal  filter  is 
robust  if  and  only  if  there  exists  k  €  IR^,  such  that 

(nQ  +  a2I)"1s0  -  kn^SQ  (3.23) 

but  this  is  equivalent  to 

[(l-k)/o2]s0  -  n^  (3.24) 

and  since  n^1  is  invertible,  (3.24)  holds  if  and  only  if  is  an  eigenvector 
of  nQ. 

[i_  uncertainty] .  Let  S2,  the  class  of  allowable  signals,  be  defined  by 

Sj  *  (s  S  ]R^,  max|si  -  |  <  A,  i  *  0, . . .  ,k-l}  (3.25) 

In  order  to  get  the  least  favorable  signal  in  this  class,  according  to  (3.3) 
we  have  to  find  out  the  solution  to  the  minimization  problem: 

k-i  k-1 

argmin  I  2  (n  X).  s  s  (3.26) 

s  6  JQ  x  ...  x  Jk_x  i-o  j-0 

r 

with  ■  [Sq^-A,  Sqi  +  A]  .  In  some  special  cases  an  analytic  result  is 


achievable: 
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Proposition  3 

If  the  samples  of  the  noise  are  uncorrelated  the  least-favorable 
signal  sL  in  S 2  is  given  by: 


s 


Li 


0 

-A  <  s0i  <  A 

0i"A 

A  <  s0i 

0i+A 

S0i<“A 

Proof 


If  the  noise  samples  are  uncorrelated  ve  have: 


-1 


nQ  -  diag 


with  l i  >  0.  For  all  i«0,...,k-l,  it  is  easy  to  see  that: 


SiSLi  *  (aLl>' 


for  any  s  €  Since  hLi  *  £  s^,  this  implies: 


<  s  ,h  >  *  min  <  s,h  > 

L*  L  o  L 

s  e  s2 


Therefore,  by  Theorem  3.2,  is  the  least  favorable  signal. 


Proposition  A 


If  there  exists  an  element  sL  G  5^  such  that 


Li 


soi-4  I£hu>0 

S01+A  1£hLi<0 


with  h^  *  no^sL*  then  sl  *s  *east  favorable  signal. 


(3.27) 


(3.28) 


(3.29) 


(3.30) 
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Proof 

The  expressions  (3.25)  and  (3.30)  imply  that  for  any  s  €  S2  and 
i *  0, . . . ,k-l. 


Li  i 


hLiSLi 


(3.31) 


Since  this  is  sufficient  in  order  to  have  (3.29),  s  is  the  least  favorable 

la 

signal  in  S 

Note  that  this  proposition  suffers  from  the  common  inconvenience  of 
the  results  derived  from  the  uncertainties  separation  theorem,  namely  s^ 
depends  on  h  ,  and  therefore  the  solution  must  be  reached  recursively  except 

la 

for  special  cases.  However,  we  can  assure  the  existence  of  a  solution  of 
the  type  of  (3.30)  and  its  direct  computation  under  the  condition  of  the 
following: 


Proposition  5 

If  the  maximum  deviation  from  the  nominal  signal  in  each  sample  is 
bounded  by: 


A  <  min|hrt. I /max  £ 


Oi ' 


j 


m 


S^jm1 


(3.32) 


the  least  favorable  signal  s^  in  S ^  is  given  by 


s  -  A 

if 

h_ . 

>  0 

3  .  m  * 

Oi 

Oi 

Li 

S0i  +  A 

if 

h0i 

<  0 

where  h^  ■  n^s^  is  the  nominal  matched  filter. 


(3.33) 


With  h. 


n0lsL  and  UL 


SL"S0> 


for  i »  0, . 


,k-l 


Proof 
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|hU-h01l  ■  <3-34> 

since  6  5^,  the  absolute  value  of  the  components  of  u^  is  bounded  by  A. 

Thus 


and  by  (3.32) 


(3.35) 


V  < 


Oi1 


which  is  sufficient  in  order  that 


(3.36) 


sgn(hL±)  *  sgn(hQ1)  (3.37) 

Therefore,  by  Proposition  4,  s^  given  by  (3.33)  is  the  least  favorable  signal 
in  S2. 

3.2.2  Noise  Uncertainty 

Here  we  suppose  that  the  nominal  signal  Sq  is  truly  present  at  the 
input,  but  the  actual  covariance  matrix  n  is  allowed  to  differ  from  the 
nominal  n^.  The  first  is  a  general  result  useful  for  different  classes  of 
uncertainties. 


Lemma  3.2 

n^  is  the  least  favorable  noise  for  every  nominal  signal  s^  S  1R  if 

•k 

and  only  if  n  is  a  maximal  element  of  N  (the  uncertainty  class). 

la 

Proof 

Since  there  is  no  uncertainty  in  the  signal  we  only  need  to  prove 

It  is  supposed  that  N  does  not  depend  on  s^. 
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conditions  1  and  3  of  Theorem  3.2. 

By  definition,  n^  is  a  maximal  element  of  W,  if  and  only  if 

\  >  n  (3.38) 

for  all  n  €  W.  But  this  is  equivalent  to  (3.10)  holding  for  all  s^  £  ]Rk 
since  for  every  h^  6  It  there  exists  sQ  -  n^h^. 

As  an  application  of  this  previous  Lemma,  we  have  a  result  on  the 
least  favorability  of  white  noise: 

Proposition  6 

Suppose  n^  *  cl  €  W,  this  is  the  least  favorable  covariance  in  N  if  and 
only  if  llnll  ^  <  c  for  all  n  €  N. 

Proof 

According  to  Lemma  3.2,  cl  is  the  least  favorable  covariance  in  W  if 
and  only  if  it  is  the  maximal  element  of  the  class,  i.e.  for  all  n  6  W, 
x  €  ®kf  x  +  0: 

xT(ng-n)x  ■  cllxll^  -  xTnx  >  0  (3.39) 

(xTnx) /II xl! ^  <  c  (3.40) 

Equivalently,  by  Rayleigh's  principle  (6], 

P(n)  <  c  (3.41) 

where  p (n)  denotes  the  spectral  radius  of  n  (maximum  absolute  value  of  its 
eigenvalues),  but  llnll 2  *P(n). 


II nil 2  ■  Operator  norm  of  n. 
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Next,  in  analogy  with  the  t ^  signal  uncertainty,  we  deal  with  a 
specific  deviation  class  defined  by 

*  {n  €  II  n  -  Hq  II  <  e,  n  >  0}  (3.42) 

where  e  is  a  positive  constant,  Uq  is  the  nominal  noise  covariance  matrix 
and  the  norm  is  any  valid  matrix  norm. 

Proposition  7 

The  least  favorable  noise  in  is 

n^  *  Hq  +  el  (3.43) 

Proof 

By  means  of  Lemma  3.2,  this  is  equivalent  to  proof  that  nL  is  a  maximal 

1c 

element  of  For  any  n  €  and  x  €  ]R  : 

xT(nL~n)x  ■  xT(nQ  +  eI-n)x  »  ellxll^  +  xT(n^-n)x  ;  (3.44) 

but  by  the  Schwarz  inequality 

|xT(n0-n)x|  <11x11  II  (nQ  -  n)x  II  (3.45) 

<  llxll2lln0-nll  ,  (3.46) 

where  the  last  inequality  must  hold  f<  r  any  type  of  matrix  norm  [7] .  Combining 
(3.44)  and  (3.46)  we  get 

x^(n^  -  n)x  >  llxll^  (e  -  II  Hq  -  n  ID  >  0  (3.47) 

Finally,  it  is  easy  to  see  that  S  since  for  any  x  £  1R  ,  x  4  0: 
x^n^x  *  x^n^x  +  ellxll^  >  0 


(3.48) 
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Note  that  the  expression  <h,nh>  can  be  put  as  the  inner  product  of 
T 

matrices  [n,hh  ]  defined  by 


[a,b]  *  tr  {a  •  b^} 


(3.49) 


Then  other  uncertainty  classes  can  be  solved  analytically  via  this  inner 
product.  The  reader  is  referred  to  Section  4.5. 


3.2.3  Uncertainty  in  Signal  and  Noise 

In  this  section  we  suppose  that  we  receive  some  signal  s  €  S  and  some 
noise  with  covariance  n  S  N.  We  must  seek  for  a  least  favorable  pair  (s^,n^) 
that  fulfills  conditions  2  and  3  of  Theorem  3.2.  If  S  and  N  are  such  that 
analytical  solutions  for  the  least  favorable  signal  and  noise  are  available, 
we  get  a  set  of  two  equations  in  two  unknowns:  and  n^,  whose  solution 

exists  if  and  only  if  there  exists  a  least  favorable  pair  in  (S,M). 

Further  simplification  is  obtained  when  one  or  both  of  the  equations 
gives  s^  and/or  directly,  i.e. 

(signal).  As  an  example  consider  the  case  in  which  S  =>  (3.16)  and  M  * 

(3.42).  Recall  that  the  least  favorable  signal  and  noise  were  given 
respectively  by: 


SL  does  not  depend  on  the  input  noise 


SL  *  S0  -  hL 


“L  ”  n0  +  £l 


Therefore  the  robust  matched  filter 


N.  *  "L  SL  is 


hL  "  (n0  +  +  oWS 


(3.50) 


2 

with  o  ||h.  II  ■  A.  Note  that  if  the  nominal  noise  is  white,  the  nominal  matched 

la 

filter  (hg  •  Sg)  is  robust  for  uncertainties  in  signal  and  noise  defined  by 


and  respectively. 
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3.2.4  Signal  Selection 

In  this  section  we  address  the  point  of  optimum  signal  design  for 
matched  filtering  in  the  presence  of  uncertainties.  We  will  assume  (as 
is  generally  the  case  in  signal  selection)  that  the  power  of  the  signal 
to  be  selected  is  constrained.  When  the  incoming  signal  and  noise  axe 
known  and  are  processed  by  their  matched  filter,  it  is  well  known  that 
the  signal  that  maximizes  the  SNR  is  the  minimum-eigenvalue  eigenvector 
of  the  noise  covariance  matrix.  However,  as  we  will  see,  when  the 
received  signal  is  deviated  from  the  one  sent  this  is,  in  general,  no 
longer  true. 

In  essence,  the  problem  of  signal  selection  under  uncertainties  is 

s  *  arg  max  max  inf  p(h;s,n)  (3.51) 

m  Sq.IIsqII-c  h  €  X  (s  ,n)  6  S(sQ)x  W 

where  in  order  to  simplify  matters  we  have  assumed  that  the  noise  and 
signal  uncertainty  classes  are  independent,  and  that  the  signal  uncertainty 
class  is  the  deviation  from  a  given  nominal  (like  the  models  treated  in 
3.2.2).  It  is  clear  that  if  the  signal  is  known,  it  has  to  be  designed  to 
give  optimal  SNR  with  the  least  favorable  noise,  i.e.  it  must  be  its  minimum- 
eigenvalue  eigenvector.  If  the  noise  is  known,  we  investigate  the  signal 
selection  problem  for  the  same  uncertainty  classes  that  we  did  in  3.2.2. 

In  any  case  (3.51)  results  in 

min  (  s ,n  ^s  > 
seS(sQ) 

<hL(s0),nhL(s0)>  .  (3.52) 


s  =*  arg  max 

Vllso"’c 


arg  max 

so’llso,|MC 
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i)  l„  uncertainty.  Recalling  <3.22),  we  have 

<hL(s0),nhL(s0)>  -  sJ(n  +  o2l)_1n  (n  +  a2I)"1S()  .  (3.53) 

It  is  easy  to  prove  that  x  is  eigenvector  of  n  with  eigenvalue  X ,  if  and 

2-1  2-1 

only  if  x  is  eigenvector  of  (n+o  I)  n  (n+o  I)  with  eigenvalue 
2  2 

X/(X+a  )  .  Then,  one  tends  to  think  that  the  optimum  Sq  is  the  minimum- 

2-1  2-1 

eigenvalue  eigenvector  of  (n  +  c  I)  n  (n+c  I)  (and  therefore  an  eigen¬ 
vector  of  n) ;  however,  this  is  not  necessarily  the  case,  because  o  depends 

on  Sq.  Let  be  a  family  of  orthonormal  eigenvectors  of  n. 

k 

Then  with  Sq  -  £  a^^ 


s^n+a2!)"^  (n+a2I)-1s0  »  £  X  a*/(X.  +a2)2  (3.54) 

i«l 

dh  (s  )  II  -  (  I  a2/(X  +02)2)*5  (3.55) 

i*l 


So  (3.52)  results  in  the  following  nonlinear  optimization  problem: 

^  2  2  2 

arg  max  £  X  a,/(X  +  a*;  (3.56) 

{ajL}  i-1  1  1  1 

subject  to 

Vr 

4  _  2,„  2.2  .2 

a  £  a./(X  +<j  )  -A  (3.57) 

i*l 

k  2  2 

£  a  -  c  (3.58) 

i-1  1 

ii)  £_  uncertainty.  When  the  noise  is  uncorrelated  we  obtained  the 
explicit  solution  for  the  least  favorable  signal  in  (3.27).  Applying  this 
expression  to  (3.52)  we  get 


38 


m 


*  arg  max  SM(sn) 

so’llso"“c 


SM<s0>  "  <SL(S0)‘  n"lsL(s0)> 


\.ir  * 5r<*oi+i)2 

s01>i  1  soi<-i  1 


(3.59) 


where  (X is  the  set  of  eigenvalues  of  n. 


Proposition  8 

The  optimal  nominal  signal  for  robust  matched  filtering  when  the 
noise  has  a  known  diagonal  covariance  matrix  and  the  uncertainty  on  the 
received  signal  is  modeled  by  an  class,  is  a  minimum-eigenvalue  eigen¬ 
vector  of  the  noise  covariance. 


Proof 

We  suppose  implicitly  that  the  signal  uncertainty  is  small  compared 

2  2 

with  its  power  (specifically  kA  <  c  ) .  First  we  show  that  if  Sq  is  such 
that,  for  some  i,  0  <  j  Sq^  I  <  A,  there  exists  s^  such  that  II  II  *  II  s^  II  *  c 
and  SM(Sq)  >  SM(Sq) .  Let 


Oj 


Oj 


✓  2  2 

s‘  + 

Oi  Om 


i  *  i 

j  +  i,m 


(3.60) 


m 


with  sQm  *  max  { Is^l  }  >  A 


SM(s0)  -  SM(s0)  -  j-  f  (Vs0i 
m 


1  i  ,  / 2  2  .2.2 

r~  f  (vsnj  +«n_  -  A  ) 


Om 


<|s0il  -A)‘]  >  0 


I 


(3.61) 
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Also,  if  Sq  is  such  that  Sq^  >  A  with  X^  >  X^  *  min  {X^},  there  exists 

*  *  *  * 

Sq  such  that  II  11  ■  II  s^  II  ■  c  and  SM(Sg)  >  SM(Sg).  Let  Sg  be  defined  by 

(3.60)  replacing  m  by  p. 


“*<•!!>  -  “<•<>>  .5J-KI.*  |-i)2-(|a0  |-i)2|-f  (Is 
P  i 


Oi 


~  A)  ‘ 


>ir12i<ls0pl  +  1*01*  -  I’op1'  '  i2) 


(3.62) 


Now  consider 


Op  1 


2  s 


Oi  1  Op 1 


A(  s 


Op 


+  s 


Oi1 


-  (e  +5  )  A  +  2  e  6  >0  , 


(3.63) 


where  e  *  ls0iJ  "  d,  5  »  (sg^l  -  A  are  positive  by  supposition  (otherwise 

*  .  1  1  1  A 

(3.61)  shows  already  that  sQ  is  not  optimal).  Since  |  sQp  I  I  sqi  l  >  "2* 

(3.63)  implies  that  the  right  side  of  (3.62)  is  positive,  and  thus  we  get 

the  desired  result.  Therefore,  we  have  shown  that  the  only  Sg  for  which 

there  does  not  exist  a  different  signal  with  better  SM  is  the  one  having  all 

the  samples  corresponding  to  eigenvalues  greater  than  the  minimum,  equal 


to  zero. 
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4.  LINEAR  OBSERVERS  AND  REGULATORS 


4.1  Introduction 

Undoubtedly,  modern  control  Is  one  of  the  areas  In  which  the  minimax 
approach  to  problems  with  uncertain  models  has  flourished  more  markedly. 
Specifically,  the  great  practical  Impact  of  Kalman  filtering  has  resulted 
in  a  sizable  portion  of  technical  literature  related  to  the  design  of  optimal 
observers  for  systems  whose  noise  statistics  are  not  accurately  known.  How¬ 
ever,  up  to  now,  the  problem  has  not  been  treated  with  full  generality;  the 
available  results  are  related  either  to  the  steady-state  case  or  to  parti¬ 
cular  classes  of  uncertainties.  Moreover,  the  main  relevance  of  this  appli¬ 
cation  of  our  general  results  for  minimax  robust  filtering,  stems  from  the 
fact  that  the  payoff  functions  that  we  encounter  in  these  filtering  situations 
are  not  concave- convex  and  thus  the  use  of  the  conventional  theorems  of  game 
theory  is  not  possible  (we  will  show  that  the  often  quoted  treatment  of 
D'Appolito  and  Hutchinson [19 ]  that  relies  on  those  results,  is  incorrect). 

As  in  the  other  cases,  two  kinds  of  results  will  be  presented,  first  a 
minimax  theorem,  that  gives  a  saddle  point  solution  to  the  corresponding 
game  by  showing  that  the  minimax  equality  holds,  and  thus  the  solution  is 
the  optimal  filter  for  the  least  favorable  operating  point,  and  second  a 
procedure  to  find  these  least  favorable  situations  for  general  uncertainty 
classes.  In  order  not  to  be  overly  repetitive,  our  emphasis  will  be  in  the 
discrete-time  case,  presenting  the  main  results  in  continuous-time  as  well. 

We  will  analyze  the  following  filtering  situations: 

i)  the  linear  observer  problem  for  predictor  state  estimation 

(Vk-i) 
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ii)  the  linear  observer  problem  for  filter  state  estimation 
iii)  the  regulator  problem  for  linear  quadratic  optimal  control. 

Finally,  several  discrete-time  examples  will  illustrate  the  search  for 
least  favorables  for  general  uncertainty  classes  of  covariance  matrices 
of  practical  interest. 

For  convenience,  we  repeat  in  the  form  of  lemmas  the  results  from 
Chapter  2  that  will  be  used  in  this  chapter. 

Lemma  1 

Suppose  that  5(h,*)  is  concave  in  Q  for  every  hex*. 

* 

If  a  least  favorable  operating  point  —  for  (Q,X  ,5)  —  q  ,  and  its 

then  h^  is  a  robust  filter  for 

(P,K,S). 

Lemma  2 

Suppose  that  Q  is  a  cartesian  product  of  sets,  Q  -  X  , , .  x  and 
that  the  penalty  function  can  be  put  as 

5(h,(q1 . qfe))  -  F(f1(h,q1),...,fk(h,qk))  > 

with  F  nondecreasing  in  each  one  of  its  arguments.  Denote,  for  i  *  l,...,k 
* 

q.(h)  -  arg  max  f (h,q)  , 

’ie«i 

If  is  a  solution  of  the  equation 

hL  -  arg  aia_*  5(h,(qJ(hL),...,qJ(h^))  ,  (2.20) 

h  2  7C 

then  h^  is  a  robust  filter  for  (P,X,5). 


optimum  filter  h^,  form  a  regular  pair. 
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Lemma  3 

it  it 

Under  the  suppositions  of  Lemmas  1  and  2,  q^  «  (q1 (h^) , . . . ,qk(hL) )  is  a 

* 

least  favorable  operating  point  —  for  (Q,3f  ,5)  —  if  and  only  if  h  is  a 

Lt 

solution  to  (2.20). 

4.2  Robust  observer  problem 

Consider  the  following  linear  discrete-time  system 

Vi  •  Vk  +  \\  +  \  *  \  •  ko‘k<  "-1  <4-l> 

z  “  SA  +  vk  +  \  ’  kc  <  k  <  N-l  (4.2) 

with  an  nX  1  state  vector,  u^  an  mX  1  control  vector  and  an  rX  1 
output  vector.  The  initial  state  (x^  )  is  a  random  vector  with  mean  m 
and  variance  and  (w^  +  w^) ,  (v^  +  v^)  are  random  sequences  independent 
of  the  initial  state,  representing  the  process  and  observation  noises  with 
means  and  and  covariance 


cov  ( 


(4.3) 


We  suppose  that  the  means  and  the  covariance  matrices  are  known  only  to 

belong  to  some  uncertainty  classes,  col(mQ,wk  ,,',,wN-2’vk  *  * • * » ^—2^  * 

N— 1  0  ° 

y  e  M,  ^’^k^k’^k-k  “  A  €  C’  such  that  "k  and  2o  are  Positive 

o 

semidef inite,  ©k  is  positive  definite  and  is  congruent  with  2^  and  0^. 
In  the  presence  of  these  uncertainties  the  optimal  observer  and  regulator 
problems  will  be  solved  in  a  minimax  sense,  i.e.,  our  goal  is  to  find 

h_  *  arg  min  sup 

K  h  €  X  (y,A)€  MxC 


5(h,(y,A)) 


(4.4) 


I 


I 

I 

I 
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I 

I 

I 

I 

I 

I 

I 


where  3C  is  some  class  of  allowable  filters  and  £(-,*)  is  a  penalty  function. 
In  order  to  solve  this  game  (\i*C,X,6)  we  will  apply  the  general  results 
of  Chapter  2. 

1.  Formulation 

The  optimal  linear  observer  of  the  state  of  (4.1)  is  the  solution  to 

K  E|V,  qk  v,1",'1,1  ■  (4-5) 


where  JC  is  Che  sec  of  linear  filters  of  appropriate  dimensionality  with  state 

W  *k/<l  4  \  '  W  %  "  \ . V*  Zq  ■  \ . V  “d  Qk  ls  * 

positive  semidefinite  weighting  matrix.  When  the  second  order  statistics 
of  the  stochastic  system  (4.1-3)  are  known, the  solution  for  every  time  k  is  _ 
given  by  the  Kalman  filter;  the  cases  of  most  general  interest,  i.e., 
q  *  k-1,  (one-step)  predictor  estimate,  and  q*k,  filter  estimate,  are 
given  by  (e.g.  [15])  recursive  formulas  of  the  form 


\+l/k  “  Ak  *k/k-l  +  Bk  \  +  WLk  +  ^k^zk  *  VLk’Ck  \/k-l}  ’ 

\/k  *  \/k-l  +  Ek(*k“  VLk"Ck  \/k-l)  » 

\  /k  -1  °Lo 
o  o 


Subtracting  (4.6)  from  (4.1)  and  from  both  sides  of  (4./)  we  get 
error  dynamical  equations  for  arbitrary  K^,  E^,  ra^,  w^,  v^: 


(4.6) 

(4.7) 

(4.8) 


the 


ek+i/k *  Vk-rVk +  \ +  •  (4-9) 

*k/k  *  (I  "  W  ek/k-l  ~  Ek\  '  ^^k  "  vLk)  *  (4.10) 

•k  /k  -1  “  *k  “  °Lo  * 

OO  0 


(4.11) 


Denoting  the  error  variances  by  2^^  *  var  [e^^l  >  the  vector  of  means 
in  the  estimator  by  y^  (defined  analogously  to  y) ,  and  the  state-transition 
of  the  fundamental  matrix  (A-KC)  by  *(»,•)  and  taking  into  account  that 
the  white  sequences  w^,  v^  are  independent  of  (4.9-11)  lead  to 

T  ,  ^.a  .  ^  _ _  „  ,  .  T 


"Vw  Vk-i'  ■  :k/k-i  +\(lW(y-yLHy-!'I.)  <W  * 


T^T 
k 


k-1 


Sk/k-l  *  *(k’ko>20  *  <k»k0)  +  2  *0c.J+l>  M  *r(k,j+l) 

J‘ko 

Mk  -  (  I  : 


1  ❖ 

[ 

I 

k  k 

'ffj  e 

1 

< 

k  k 

1 

_  — 

(4.12) 

(4.13) 

(4.14) 


\  m  l*<k»k0>  :  :  <,>(k’k)  :  °nx  (N-l-k)n1 

0 


(IK) 


nX  (N-l-ko)r 


(N-k  )nl......°..i^_ 


“V: 


■JVk  +  V^tH^L>T,![  - 


V* 

“'k/k 


Fk-H-Ekck]ek  W-^Io 


•  I, 


JrX  (N-K  )n’  (N-l-K)r 
0  0  o 


(4.15)  - 


(4.16) 


(4.17) 

(4.18) 

(4.19) 


Here  the  subindices  of  the  null  and  identity  matrices  indicate  their 
respective  size. 

2.  Uncertainty  in  Covariances 

We  suppose  now  that  the  means  are  known,  l.e.  y*y^.  and  therefore  that 
the  estimates  are  unbiased.  By  the  definition  of  the  state-transition  matrix 
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it  is  easy  to  check  [16]  that  S®  satisfies  the  equation 

“k+l/k  *  Vic  1  2k/k-l  [  *k  "  Vk  ^  +  \  • 


(4.20) 


The  optimal  gains  are  given  by 


Kfc  “  arg  „  “T*  nXr  “k/k-1 


min 

\s«“ 


”  [  \  2k/k-l  Ck  +  \]  1  Ck  “k/k-1  Ck  +  ®kJ  ^  (4,21) 

2k+l/k  "  2k/k-l  \  +  ik  ' 

1  2k/k-l  Ck  +  V  <  Ck  sk/k-l  CJ  +  ek  rt  Afc  Vk-i  Cl  +  V"  > 

(4.22) 

2.  f.  -  m  2  # 

k/k-1  o  9 
o  o 

h  '  “8  .  tVx  r  ^/k  '  Ck  1  ^  W  °k  +  V*1  ’  <4'23> 


where 


Sk/q  -  la 


'  k/’ 


(4.24) 


can  be  precomputed  since  it  does  not  depend  on  the  observations. 

Since  all  the  matrices  involved  are  possibly  time-varying,  in  the  remain¬ 
der  of  the  discussion  we  will  drop,  for  convenience,  their  explicit  dependence 
on  time  except  when  this  could  be  ambiguous. 

In  order  to  prove  the  minimax  theorems  we  will  make  use  of  the  following 
sensitivity  result. 

Lemma  4 

Suppose  C'  is  a  convex  set  and  (Z,*,9,S  )  -  A  -  (1  -  a)  Ar  +  aii  with 

o  L 

'Y  "  ^eL,'*L’®L’2oL^  ^  "  (V»X,Y,Z)  both  belonging  to  C* .  Let  xjj^  be 
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the  state  estimate  of  the  Kalman  filter  designed  for  when  A  is  truly 
present.  With  5k/q  »  \~\/q  and  Tk/q  £  E  [  i^/q  S£/q  ]  ,  we  have  that 
for  all  €  C*  and  time  k,  and  for  q *  k,k-l 


lim  I  (V  _  r  \ 
a-*-o  a  "k/q  k/q 


0 


(4.25) 


Proof 

"k/k-1  an<*  ^k/k-l  satlsfy  ^nations  (4.14)  and  (4.20)  for  the 
*  *  , 

optimal  Kalman  gains  K  and  K^  designed  for  A  and 
that  in  both  cases  the  same  set  of  covariances  (S,'!',©,^)  is  present. 
Therefore,  we  have 


respectively.  Note 


Sk+l/k  '  rk+l/k  *  l  A  -  \c  ]  (“k/k-1  "  rk/k-l}  [  A  ‘  *LC  1 

*  *  T  *  *  T  *  *T  *  *T 

-  *(K  -O  -  (K  -IL)  *  +  K  «  -ILSI^ 

-  «**<>  c  “k/k-i  aI“a  sk/k-l  cV-<)T 
“  V  Sk/k-l  Cl  AT  +  K*  C  £k/k-l  Cl  K*T 

-'A“Vi  <2k/k-i“rk/k-i)[A-<c,T 

-  (K*-b£)  [cJk/w  cT  +  ©]  (K*-k£)T  (4.26) 


where  in  order  to  get  the  last  equality  the  expression  for  K  (4.21)  has 
been  taken  into  account.  It  is  worth  noting  that  (4.26)  gives  the  varia¬ 
tion  of  the  error  variance  due  to  an  arbitrary  modification  (up  to  here 
* 

the  value  of  K^  has  not  been  used)  of  the  optimal  Kalman  gain  (cf.  [17]). 

Because  lim  —  [K  -KjJ  exists  for  every  time  k  and  A^,fi  €C'  (see  Appendix  1) 
a  ■*  o 

in  the  limit  (4.26)  reduces  to  a  homogeneous  equation,  and  since  ■ 

£o  *  rk  /k  -1  ”e  hAV* 
o  o 
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lim  1 

a  -*■  o  a  "k/k-1 


rk/k-l} 


(4.27) 


Using  (4.12)  for  E  and  E^, 
and  Che  expression  (4.23)  for  E  ,  a  series  of  manipulations  result  in 

Sk/k  *  Pk/k  -  1 '-k/k-1 (I'<C|1 

-  (E*-e£)  fc  2k/k_1  CT  +©]  (E*-e£)T  (4.28) 

Now,  taking  lim  —  [•!  of  both  sides  of  (4.28),  the  first  term  of  the  right 

OL-f  Q 

side  is  zero  as  a  consequence  of  (4.27)  and  the  second  term  vanishes  as 

1  *  * 

well,  because  lim  —  (E  -  E^)  exists  as  can  be  shown  similarly  to  Appendix  1. 
o  o 

This  ends  the  proof  of  the  Lemma. 

If  when  the  second  order  statistics  are  uncertain,  we  use  the  weighted 
square  error  (4.5)  as  the  penalty  function,  inasmuch  as  the  history  of  the 
least  favorable  statistics  can  be  different  for  every  k,  it  is  possible 
that  the  minimax  filter  is  given  by  a  recursion  chat  does  not  solve  the 
problem  for  previous  times,  therefore  we  are  forced  to  huild  a  different 
filter  for  every  k,  running  from  kQ  to  k.  The  inapplicability  of  this 
solution  leads  to  the  consideration  of  the  penalty  function  (cf.  [13D 


again  the  gains  designed  for  A  and  A.  respectively. 


L(H,y,A) 


(4.29) 


that  when  the  statistical  model  is  known  has  the  same  minimizing  filter  as 
(4.5).  Furthermore,  note  that  the  penalty  function  of  (4.5)  can  be  con¬ 
sidered  a  particular  case  of  (4.29). 

We  assume  that  C  is  an  uncertainty  class  such  that  there  exists  a  convex  set  C'  DC 
that  fulfills  (2.13)  achieving  the  inf  sup,  i.e.  with  least  favorables.  Given  that  the 
means  are  known  and  JC  is  the  set  of  linear  filters ,  the  following  minimax  theorem  holds . 
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Theorem  1 

The  Kalman  filter  for  the  least  favorable  collection  of  covariances 
for  (predictor/filter)  state  estimation  is  robust  for  the  game  (C,3C,L). 

Proof 

First  consider  the  class  3C  of  linear  filters  of  the  form  (4.6-8)  for 
K,E  €RnXr  and  with  matrices  A,B,C  given  by  the  original  system  (4.1-2). 
Then  the  penalty  function  is 


L<li-A) '  It  tr(Qk 

o 


(4.30) 


with  given  by  (4.13),  (4.14),  (4.18)  for  q**k,k-l.  Now  we  check  that 

the  sufficient  conditions  of  Lemma  1  hold  in  this  case:  i)  L(K,A)  is 

linear  in  2,  'P,  9,  2  and  thus  concave  in  C;  ii)  it  follows  from  Lemma  4 

o 

that  every  element  in  C'  forms  a  regular  pair  with  its  associated  Kalman 

filter;  and  iii)  since  the  Kalman  filter  (optimum  over  X)  for  a  given  A, 

* 


belongs  always  to  the  class  X  we  have  condition  ii)  of  Theorem  2.1  for  any 

set  of  covariances.  (Note  that  if  x^  is  a  gaussian  random  vector  and  the 

o 

process  and  observation  noises  are  jointly  gaussian,  we  can  drop  the  restric¬ 
tion  of  linearity  for  the  class  X  in  the  Theorem.)  Thus  the  assumptions  of 
Lemma  1  are  satisfied  and  Theorem  1  follows. 


Remark 

* 

The  last  proof  allows  us  to  notice  the  role  of  the  class  X  in  Th.  2.1 
Thanks  to  the  previous  restriction  to  the  set  of  filters  with  given  A,  B,  C 
matrices  we  were  able  to  prove  the  concavity  in  the  uncertainties  of  the 
penalty  function,  while  the  theorem  holds  for  the  general  class  of  filters. 
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(This  restriction  of  the  set  of  filters  has  been  previously  justified  by 
the  convenience  of  an  unbiased  estimate  [131  -  However,  this  is  not  enough 
to  prove  the  optimality  of  the  solution  over  all  possible  linear  estimators.) 

Theorem  1  has  reduced  the  original  minimax  filter  design  problem  to 
the  search  for  least  favorable  sets  of  covariances.  In  this  point  the 
application  of  Lemma  3  has  proven  to  be  successful  in  dealing  with  impor¬ 
tant  uncertainty  classes  in  other  contexts  [51.  In  the  presen?  case,  let 
us  define 

a  S*1  T 

W.(R)  -  2  *A(k,j)  R.  *.(kfj)  (4. 3D 

1  k-j  L 

where  R^  €  ]R  n*n  (if  R  >  0  then  W(R)  >  0),  and  *L(*,*)  is  the  state  transi¬ 
tion  matrix  of  the  fundamental  matrix  (A  -  K^C) .  Using  this  definition  and 
interchanging  the  order  of  summat ion,  (4.13)  and  (4.18)  result  in 


X  tr(<!k  “k/k-l}  ‘  tr(S»  “k  «>  +  X  Vl®” 

K*k  o  j ** 


(4.32) 


N-l 


2  tr{Qk  2k/k}  -  tr{2o  wk  ( [i  -  v  r  q  [i  -  v  i) 


k-k 


N-2 


+  2  Mj  Wj+]_  (  [I  -  E^C  J  Q  [I  -  E^C  ]) 


N-l 

+  2  ^kWkV 

k«k 

o 


(4.33) 


Applying  Lemma  3  to  these  expressions,  Theorems  2  and  3  follow  straight¬ 
forwardly. 
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Theorem  2 

If  C'  can  be  put  as  the  cartesian  product  of  uncertainty  classes 
X  x  P  x  v  x  S,  then  (ST ,9. ,2  . )  is  the  least  favorable  collection  of 

L  L  L  OL 

covariances  for  predictor  state  estimation  if  and  only  if  the  following 
are  satisfied 

N-2 

— T  *  arg  max  tr{  2  2  (4.34) 

L  2  €  X  j-k  J  3  L 

J  o 


N-2 


%  •  arg  min  tr{  I  *.  K.\  V  (Q)  } 

L  *  e  p  j-k  2  3  J  1 


N-2 


0  -  arg  max  tr{  2  6,  K.,  W  (Q)  1^,} 
L  0  6V  i-k  J  L;3  J  1  J 


2  -  arg  max  tr{2  W  (Q) } 

oL  2  €  S  o 

o 


\K-f\ ^/k-l  <*  +  *U<>  1  \  -WK-1  <£  +  V'1 

2^  ■  A.  2L  A?  +  2 

k+l/k  Tc  k/k-1  \  Lk 


(4.35) 

(4.36) 

(4.37) 

(4.38) 


,-1, 


"  [  \  Sk/k-l  Ck  +  ^Lk  1  [Ck  2k/k-l  Ck  +  ®Lk ]  [\  "k/k-1  Ck  +  \kJ 

(4.39) 


2£  /k  -1  “  2oL 
o  o 


(4.40) 


Notice  that  while  the  optimal  observer  for  fixed  covariances  is  independent 
of  the  error  weighting  matrix  Q,  this  is  not  the  case  here,  since  the  least 
favorable  covariances  may  depend  on  Q.  Moreover,  the  least  favorable  covari¬ 
ances  (from  k-k  to  N -  1)  depend  on  the  final  time  N. 
o 


51 


Theorem  3 

Under  the  hypothesis  of  Theorem  2,  ^l»'*L,^L’“oL^  C^e  ■'•east 
favorable  collection  of  covariances  for  filter  state  estimation  if  and 
only  if  (4.38-40)  and  the  following  are  satisfied 


2  »  arg  max  tr{  2  P..,} 

L  Z  €  X  j-k  2  3 

J  0 

(4.41) 

N-2  _ 

V  m  arg  min  tr{  2  'P.  K.  .  P .  . .  ) 

L  ^  ^  p  i*lc  ^  ^ 

J  o 

(4.42) 

N-l  —  N-2 

9t  -  arg  max  tr{  2  ©  EL1.  Q  EL,  +  2  0 

L  ee  Si  j«ko  J  J  j-kQ  3 

^  Vl  V 

(4.43) 

2  .  »  arg  max  tr{2  P,  } 
oL  s  €S  °  k0 

o 

(4.44) 

Pj  -  W^ai-EjC]1  q  [  1-EjC]) 

(4.45) 

_  VL  _T  .  _  vh  «T  ,  n  1 

^k  *  “k/k-1  Ck  C  Ck  “k/k-1  Ck  ®LkJ 

(4.46) 

3.  Uncertainty  in  Means  and  Covariances 

Once  we  have  solved  the  problem  for  uncertainty  in  the  covariance 
matrices ,  we  drop  the  assumption  that  the  means  are  known  and  we  attack  the 
completely  general  question.  The  penalty  functions  for  predictor  and  filter 
state  estimation  are  derived  from  (4.5) »  (4.12),  (4.17) 


LCh.y.A)  -  i  (tr{Q.  If/k  .)  +  (y-yL)T(iK)T  *1  \  \dK)(y-yL) 
p  k«k 

o 


L  (h.y.A)  -  2  (tr(Qk  2*/k>  +  (y-yL)  Ffe  Qfe  Fk(y-yL)) 
z  k»k 

o 


(4.47) 


(4.48) 
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Ic  is  not  possible  to  apply  Lemmas  1  and  3  Co  these  penalty  functions 
because  they  are  not  concave  in  the  uncertainties  (in  fact,  they  are  convex) 
However,  we  can  put  the  uncertainty  set  as  the  cartesian  product  x  x  P  x  V  * 
S  x  M,  and  the  part  of  the  penalty  function  due  to  the  error  variance  can 
be  decomposed  in  the  same  way  as  in  Theorems  2  and  3  in  order  to  apply 
Lemma  2  (note  that  no  further  decomposition  of  the  penalty  due  to  the  means 
is  possible).  Unfortunately,  except  for  trivial  mean  uncertainty  classes, 
equation  (2.20)  has  no  solution  in  this  case.  The  reason  for  this  can  be 
seen  intuitively  considering  that  the  performance  of  a  filter  built  for  a 
given  set  of  means  can  only  be  deteriorated  when  different  means  are  truly 
present,  and  hence  no  saddle-point  solution  to  the  filtering  game  exists. 

Alternatively,  we  deal  with  the  soft  minimax  solution  to  the 
problem.  According  to  this  approach,  useful  to  model  frequent  situations 
in  which  there  is  not  "total"  uncertainty,  operating  points  that  are 
closer  to  a  given  nominal  are  more  likely  to  pccur  than  those  in  the 
uncertainty  class  more  distant  from  it.  This  farther  knowledge  can  be 
taken  advantage  of,  by  adding  to  the  penalty  function  an  additional  term 
accounting  for  the  distance  between  the  operating  point  and  the  nominal. 
Applying  this  to  the  uncertainty  in  means 

l/(h,y,A)  -  Lp(h,y,A)  -  (y-yN)T  D  (y  -  yN>  (4.49) 

where  D  is  a  positive  semidefinlte  matrix,  and  y^  is  the  vector  of  nominal 
means . 

Theorem  4 


Suppose  for  all  possible  covariances  we  have 


53 


N-l 

T* 

k"ko 


(IK)T  *k  Qk  VIK)  ‘  D  *  0 


(4.50) 


Chen  the  Kalman  filcer  given  by  (4.34-40)  and  with  nominal  means  is  robusc 
for  Che  game  (M  x  C,  X,  I/)  (prediccor  scace  escimacion). 


Proof 

Applying  Lemma  2,  we  see  chac  in  order  for  (2.20)  to  have  a  solution, 
Che  Kalman  gain  muse  be  che  opeimal  for  Che  lease  favorable  sec  of  covari¬ 
ances  (independenc  of  che  means)  and  Chus  (4.34-37)  muse  hold.  Besides  we 
need  a  solucion  Co 


y 


R 


arg  min  (y*(yR)-yL)TV  (IK)  Vq^dK)  (y*(yR)  -  yL) 
yL6M  K  k-kQ 

-  (y*(yR)-yH)T  0 


(4.51) 


N-l 

y*(yD)  -  arg  max  (y-y„)T  2  (IK)  Vq.  \(IK)  (y  -  yR) 
R  y  e  M  k-k 


-  (y-yH)  D  (y-yM) 


(4.52) 


BuC  taking  inco  accounc  chat. 


0  < 


N-l 

2 

k*k 


(rK)T<&kQA(IK> 


<  D 


(4.53) 


y*(y^)«y^,  and  therefore  y^  is  solution  to  (4.51).  An  analogous  proof  holds 
for  the  next  theorem. 


Theorem  5 

Suppose  for  all  possible  covariances  we  have 
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N-l 

k-k 

o 


Fk  «k  Fk  - D « 0 


(4.54) 


Chen  Che  Kalman  filcer  given  by  (4.38-46)  and  wich  nominal  means  is  robusc 

f 

for  Che  game  (M * C,JC,L^)  (filcer  scate  escimacion) . 

Ic  is  inceresting  Co  remark  chac  the  design  of  observers  for  systems 
wich  unknown  input  is  just  a  particular  case  of  the. unknown  means 
problem  treated  here.  Basar  and  Mintz  [20]  have  solved  a  tracking-evasion 
problem  through  the  design  of  a  filter-estimator  for  a  system  with  obser¬ 
vation  noise  and  initial  state  with  known  second  order  statistics  and 
unknown  input.  They  use  a  penalty  function  that  counterbalances  the  mean 
square  estimation  error  (in  one  inscanc)  with  a  quadratic  cosc  on  Che 
choice  of  Che  input.  In  our  notation  we  have:  for  k  ■  kQ, . . . ,N-2, 

unknown,  w^-0,  vfc»0,  ^-0,  Sfc-0,  Qk«0;  QJ?_1  i  0,  n^-0, 

12  12 

D  *  diag(DoofcDQ,D^)  where  are  matrices  of  appropriate  dimensions 

and  arbitrarily  large  positive  eigenvalues  (this  models  the  certainty 

on  m  and  v,  ) .  Then  the  sufficient  condition  of  Theorem  5  in  order  for 
o  k 

the  robust  estimator  to  have  w^^O*  reduces  to 

[  I-En_]_  Cn-1]  T[  *<N-l,k+l) :  -  i*(N-l,N-l)]  T 

QN_lt ♦(N-I,k<+1)  s  ~ :*(N-1,N-1)]  [  I-^i  CN-1]  -  Dq  <  0  (4.55) 

which  is  precisely  the  sufficient  condition  derived  in  [20  ]  in  order  for 
the  minimax  estimator  to  have  D^'O. 
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4.3  Robust  Regulator  Problem 
1.  Formulation 

The  optimal  regulator  for  linear  quadratic  optimal  control  is  the 
solution  to 


arg  min  J(H,y,A) 
H  €  X 


(4.56) 


with 


N-l 


J<H,y,A)  -  Eh?  F  k^l  +  I  Et«jQltxt+uJl52kOk|E1[.1]  (4.57) 


k-k 


where  x^is  the  state  of  the  system  (4.1-2)  with  zero  mean  process  noise, 

X  is  the  set  of  linear  filters,  with  input  ^  and 
and  F  and  positive  semidefinite  and  Qj.  positive  definite.  The  optimal 
filter  when  the  noise  covariances  are  fixed  is  [16]  : 


and  output  U^, 


N 


~Gk  *k/k-l 

[  Q2k +  Vi  \]  _1  Bk  Vi  s 

‘i  Sk+1  \  +  «lk  -  Sk+1  V^k  +  ^  Sk+1 


F 


(4.58) 

(4.59) 

(4.60) 

(4.61) 


and  x^^-i  the  optimal  predictor  estimate  given  by  (4.6),  (4.8),  (4.21), 
(4.22).  This  is  the  consequence  of  the  separation  principle  of  stochastic 
control,  that  enunciates  the  optimality  of  the  feedback  of  the  state  esti¬ 
mate  with  the  same  gain  as  that  in  the  known  state  case.  Since  the  feedback 
gain  G  does  not  depend  on  the  statistics  of  the  noise  it  should  be  expected, 
that  when  these  are  unknown,  the  optimal  value  of  G  is  unchanged,  and  thus 
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the  control  and  state  estimation  problems  can  still  be  solved  separately. 
We  will  prove  this  statement  rigorously  and  will  show  that  in  general  the 
state  estimates  are  not  given  by  a  minimax  observer. 

We  will  use  the  following  identity  [18]. 

Lemma  5 

If  the  process  noise  has  aero  mean 


EtxN  F  +  2  \  Qlk  xk  +  Uk  Q2k  UkJ 

k»k 

o 


mo  Sk  mo  +  trtsk  V  +  2  Cr{sk+1  V 

o  o  k*k 

o 


N-l 


,T,.T 


♦  k:k  +  °k  v  w  \  *  <w  (uk +  vi 

o 


(4.62) 


with  G  and  S  given  by  (4.59-61). 

2.  Uncertainty  in  Covariances 
Theorem  6 

The  regulator  consisting  of  the  feedback  for  complete  state  infor¬ 
mation,  of  the  state  estimates  produced  by  the  Kalman  filter  for  the  least 
favorable  collection  of  covariances  for  linear  quadratic  optimal  control 
is  robust  for  the  game  (C,3C,J). 

Proof 

Similarly  to  the  proof  of  Theorem  1,  we  first  consider  the  class  5C* 
of  linear  filters  of  Che  form  (4.6),  (4.58-61)  for  KS  mn*r  and  with 
matrices  A,  B,  C  given  by  the  original  system.  Let  us  define  the  positive 
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Again  this  is  linear  in  2,  9,  2q,  and  thus  concave  in  C'.  Besides 

Lemma  4  implies  that  every  element  in  L'  forms  a  regular  pair  —  for 
(C',Jf*,J)  —  with  its  optimum  regulator.  The  optimal  linear  regulator 
(over  3Q  for  any  set  of  covariances  belongs  to  the  class  3C  ,  so  we  have 
(2.7)  for  all  A  €  C,  and  the  theorem  follows. 

Note  that,  as  the  following  theorem  states,  the  least  favorable  set 
of  covariances  for  this  problem  may  be  different  from  the  one  for  state 
estimation,  hence  our  previous  assertion  that  the  Kalman  filter  for  the 
minimax  regulator  is  not  a  minimax  observer,  in  general.  By  the  same  kind 
of  manipulations  that  led  to  (4.32),  the  penalty  function  can  be  expressed 


J(H,A)  -  J  S  »0  +  tr(S  Z  +  S„  X  ) 

O  0 


+  «<:o  \  m  *  i  v  +  Mi  w)+l<,,)), 

J  n 


(4.65) 


Leona  3  can  be  applied  to  this  expression  resulting  in 
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Theorem  7 

Under  Che  same  assumptions  and  definitions  as  in  Theorem  2, 

(2.  .'I'  ,©  ,2  )  is  the  least  favorable  collection  of  covariances  for 

ILL  01 

linear  quadratic  optimal  control  if  and  only  if  (4.38-40)  and  the  following 
are  satisfied. 


L 


0 


L 


“oL 


“*  tTv  "l3S-l  r  +  *  SJ(Si+l  +  Vl<8»> 

~  x  J  o 


arg  min  tr{  2  ♦  K?  W  (N) } 

*  €  P  j-k  j  j‘t'1 

J  o 


im 

arg  max  tr{  2  8,  K.,  W  .  (N)  IC  > 

0  €  N  1-k  J  LJ  J  1 

o 

1  arg  max  tr{2Q(Sk  +  ^  (N) ) } 

2  €  S  o  o 

o 


(4.66) 

(4.67) 

(4.68) 

(4.69) 


3.  Uncertainty  in  Means  and  Covariances 

When  the  means  of  the  initial  state  and  of  the  observation  noise  are 
unknown,  the  penalty  function  is,  using  soft  minimax. 


J(h,y,A)  -  tr(Sk  Io>  +  S  tr{Sk+1  *k>  +  I  1^) 


+  (V-VL)T(JK)T  i  »k  (JK)(V-VL)  +  VT(IS)V 
^  ^0 


-  (V-VN)lD(V-VN) 


(4.70) 


where  (JK)  -  diagU^-K^ . -K^h  (IS)  *  dia*{Sk  ,0(N-l-k  )r}’ 

o  *  o  o' 

V  •  col(mo,vk  (analogously  and  then  we  have  straightforwardly 

o 

from  Leona  2,  the  following 
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t 


i 


Theorem  8 

The  regulator  consisting  of  the  feedback  for  complete  state  information 
of  the  state  estimates  produced  by  the  Kalman  filter  given  by  (4.38-40)  and 
(4.66-69)  and  with  means  V  that  solve 


V, 


R 


*  .  T  T 

(V  .^'(JK)1 


N-l 

2  *1 
k-k  * 


W(JK)(V  -VL) 


(4.71) 


V*  -  max  (V-Vr)T(JK)T  2  Nk  \(JK)  (V "  V 
V  €  M  k*k 

o 

+  VT(IS)V  -  (V-  vN)  D  (V-  y 


(4.72) 


is  robust  for  the  game  (MxC,3f,J). 

4.4  Continuous-time  case 

Consider  a  system  described  by  the  stochastic  differential  equations 

dxt  *  A(t)  xfc  dt  +  B(t)  ut  dt  +  d?  t  (4.73) 

dyt  *  C(t)  xt  dt  +  d0.  (4.74) 

with  x£  an  n*l  state  vector,  ut  and  m^l  control  vector,  yfc  and  r*l  output 
vector,  and  ?t>  0  Wiener  processes  with  incremental  covariance 

dt  (4.75) 

and  with  the  initial  state  x(tQ)  a  gaussian  random  vector  with  mean  xq  and 

variance  2  uncorrelated  with  ? .  and  0  . 

o  c  c 

We  will  suppose  here  that  the  means  are  known  and  that  the  covariance 
matrices  belong  to  an  uncertainty  set  (*(t)  .'J'(t)  ,6  (t)  ,2q)  *  A  6  C.  It  is 


cov  [d  ,  d  flj.]  ■ 


2(t) 

T 

*  (t) 


*(t) 

©(t) 


60 


well  known  that  the  optimal  observer  of  the  state,  i.e.  the  system  that 
minimizes 


L(H,A) 


E[esQ(s)es|u<s,y<s]  ds 


(4.76) 


where  et  *  x£  -  xt  and  Q(t)  is  a  positive  definite  weighting  matrix  in  the 
Kalman  filter,  is  given  by 

dx£  *  A(t)xtdt  +  B(t)utdt  +  K(t) [dy^  -  C(t)xtdt]  (4.77) 

K(t)  *  [P(t)CT(t)  +  *(t)]0-1(t)  (4.78) 

P(t)  -  [A(t)  -  'P(t)Q_1(t)C(t)]  P(t)  +  P(t)[A(t)  -  *(t)©-1(t)C(t)lT 

-  P(t)CT(t)0_1(t)C(t)P(t)  +  Z(t)  -  'Kt)©~1(t)*T(t)  (4.79) 

P(t  )  =  2  ,  x  =  x  (4.80) 

o  o  C  o 


Moreover,  the  optimal  regulator  for  linear  quadratic  optimal  control  is  the 
system  that  minimizes 


J (H,A)  -  E[x^Fxt|y<t]  + 


(xgQi(s)xs 


+  ugQ2(s)ugI  ds 


(4.81) 


is  given  by 


u(t)  =  -G(t)x(t)  (4.82) 
G(t)  -  -Q21(t)BT(t)S(t)  (4.83) 
-S(t)  -  AT(t)S(t)  +  S(t)A(t)  +  Qx(t)  -  S(t)B(t)Q21(t)B(t)S(t)  (4.84) 
S(tx)  -  F  (4.85) 
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and  x(t)  given  by  (4.77-80). 

Then,  all  the  results  in  discrete-time  have  their  respective  parallels 
in  continuous  time: 


Le™n«  6 

Suppose  C'  is  a  convex 

\  ’  (aL*'*L»eL*SoL)  and  “ 
Let  e^  be  the  error  of 
-  -T 

point  and  let  *  Eletet]  . 


set  and  (S.'fr,©,!  )  »  A  *  (1  -  a)A  +  aSl  with 

O  L 

(V,X,Y,Z)  both  belonging  to  C’. 

the  optimal  observer  for  A  when  A  is  the  operating 

L 

Then 


lim  -  [P  -  R  ]  -  0  (4.86) 

-at  t 
a  -*■  0 


Proof 

By  similar  manipulations  we  can  get  the  counterpart  of  equation  (4.26): 

(P-R)  *  (A-KlC)(P-R)  +  (P-R)(A-KlC)T  +  (K-Kl)6(K-Kl)T  (4.87) 

Analogously  to  Appendix  1  we  can  see  that  (K  -  K^)  =  0(a),  and  when  a  tends  to 

0,  all  the  coefficients  of  the  Ricati  equation  (4.79)  are  perturbed  linearly. 

Moreover,  since  the  initial  condition  P(t  )  -  R(t  )  =  0,  the  solution  of 

oo 

(4.87)  is  o(a). 

Theorem  9 

The  Kalman  filter  for  the  least  favorable  collection  of  covariances  for 
continuous- time  state  estimation  is  robust  for  the  game  (C,3C, L) . 

Proof 

Using  Lemma  6  the  proof  is  completely  analogous  to  the  one  in  Theorem  1, 


noting  that 
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L(H,A) 


tr{Qtpt>dt 


(4.88) 


For  linear  quadratic  optimal  control,  the  following  Lemma  [18]  is  used. 

Lemma  7 


E  [X  F  X  + 
C1  C1 


[Xt  Q1  Xt  +  Ut  Q2  V  dtl  * 


=  X  5(t  )X  +  tr{S(t  )£  }  + 

o  O  O  0  0 


tr{iS}dt 


+  E[ 


(Ut  +  G  Xt)  Q2(Ut  +  G  Xt)dt] 


(4.89) 


where  G  and  S  are  given  by  (4.83-85). 

Theorem  10 

The  regulator  consisting  of  the  feedback  for  complete  state  information, 
of  the  state  estimates  produce  ’  by  the  continuous-time  Kalman  filter  for  the 
least  favorable  collection  of  covariances  for  linear  quadratic  optimal  control 
is  robust  for  the  game  (C,X,J). 


Proof 

Again,  the  penalty  function  is 


J (H,  )  =  X  S( t  )X  +  tr{S(t  )2  }  + 
o  o  o  oo 


tr{SS}dt 


tr{G  Q2  G  P}dt 


(4.90) 
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and  Lemma  6  can  be  applied  to  yield  the  desired  result  by  the  same  reasoning 
as  in  the  proof  of  Theorem  6. 

Identically  to  (4.31)  we  define  in  the  continuous-time  case: 


VR) 


(t,X) 


Rt  4>l  (t,A)  dt 


(4.91) 


Then  the  next  two  theorems  are  proved  similarly  to  Theorems  2  and  7. 

Theorem  11 

If  the  covariance  uncertainty  classes  are  independent  then  (2  ) 

is  the  least  favorable  collection  of  covariances  for  continuous-time  state 
estimation  if  and  only  if  the  following  are  satisfied 


=  arg  max  tr  ]  2(t)W  (Q)dt 

2  €  X  Uti  c 


*  arg  min  tr 
L  *  €  P 


©T  =  arg  max  tr 
L  ©  €  N 


2  *  arg  max  tr{2  W  (Q)} 

0L  2  €  S  o 

o 


\  ‘  IpLcT  +  \l@[1 


f  0  I 

■n:)KuV:q>dt  } 

1 

t0  I  1 

9<t>KuV)KLt'i';  | 

^  1 


PL  ’  LA~*ieilcl  Fl  +  PLlA-*LeLlclT  '  PLCI®L1C  PL  +  2L  -  ♦iSX 


W  •  £0L 


(4.92) 

(4.93) 

(4.94) 

(4.95) 

(4.96) 

(4.97) 


(4.98) 


Theorem  12 


If  the  covariance  uncertainty  classes  are  independent  then  (Z  ,0^fZoL) 
is  the  least  favorable  collection  of  covariances  for  linear  quadratic  optimal 
control  if  and  only  if  (4.96-98)  and  the  following  are  satisfied. 


arg  max  tr 
Z  6  X 


(J 


Z(t)  [S(t>  +Wt(N)]dt 


=  arg  min  tr 
L  *  e  p 


*(t)KLtMt(N)dt 


0  *  arg  max  tr 

L  0  € 


t1 

:  tr{j  ect^OO^dt 


2  =  arg  max  tr{2  (S(t  )  +  W  (N) } 

°L  £  €  5  o  o  tQ 


N  *  G  Q2  G 


(4.99) 

(4.100) 

(4.101) 

(4.102) 

(4.103) 


Steady  state  case 

Consider  the  case  In  which  the  system  and  covariance  matrices  are  time 
invariant.  Under  the  proper  assumptions,  S(t)  and  P^(t)  given  by  the  Ricatti 
equations  (4.84)  and  (4.97)  reach  a  steady-state  solution.  If  we  consider  the 
case  t1  *  T  -*■  ao,  the  penalty  function  should  be  converted  to  lim  ^  J(T),  in 
which  case,  (4.99-101)  result  in 


Z  *  arg  max  tr{Z(s  +  W(N))}  (4.104) 

L  Z  6  X 

S'  *  arg  min  trf'P  kJ  W(N)}  (4.105) 

L  V  <=  P  L 

0T  ■  arg  max  tr{K_  @T  K?  W(N)} 

u  Q  e  u  L  L  L 


(4.106) 
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but,  from  (4.91)  we  can  see  that  W(N)  is  the  solution  to  the  algebraic 
Riccati  equation 

(A  -  1^C)TW(N)  +  W(N)  (A  -  KjC)  +  N  «  0  (4.107) 

Therefore,  (4.104)  and  (4.106)  are  precisely  the  equations  given  in  [11]  for 
the  uncorrelated  steady-state  case.  An  analogous  discussion  with  Theorem  11 
results  in  the  conditions  of  [10]  for  the  state  estimation  problem. 

4.5  Application  of  the  Conditions  for  Least  Favorability 

The  recursive  character  of  the  sets  of  equations  for  least  favorability 
given  in  Sections  2  and  3  makes  them  attractive  for  a  numerical  solution 
based  on  an  iterative  search  of  least  favorable  covariances  and  filter 
matrices.  However,  for  some  uncertainty  classes  further  analytical  results 
are  possible,  as  we  show  in  this  section.  In  order  to  illustrate  the  use 
of  the  conditions  for  least  favorability  we  will  consider  several  cases: 

i)  If  the  uncertainty  classes  do  not  impose  a  time  dependence  upon  their 

elements  (i.e.,  they  constrain  only  the  instantaneous  values  of  the  matrices), 

the  least  favorability  conditions  result  in  the  extremization  of  the  "weighted 
T 

trace",  ^^(U^  T^) ,  of  the  unknown  matrix  U  for  every  sample,  where  T  is  given 
by  every  particular  condition. 

a)  When  the  autocovariance  classes  x ,  N  or  S  have  an  element  that 
is  maximal  for  every  time  k,  i.e.  is  positive  semidefinite  for 

every  element  in  the  class,  for  all  k,  this  is  the  least  favorable 
(cf.  [12],{l4]).  This  follows  because  in  the  corresponding  conditions  every 
unknown  autocovariance  matrix  appears  multiplying,  under  the  trace,  a 
positive  semidefinite  matrix. 


b)  Suppose  all  is  known  of  a  particular  covariance  matrix  U^,  is 
that  its  elements  lie  between  some  independent  bounds 


Ufcd.j)  <  \(i.j)  <  Uk(i,j) 


(4.108) 


preserving  the  definiteness  of  the  matrix  if  it  belongs  to  an  autocovariance 
class.  Obviously  this  class  is  convex  and  the  least-favorability  conditions 


can  be  used. 

We 

get  easily 

that  the 

least  favorable  matrix 

uk(i,j) 

if 

Tk(i,j)  <  0 

oLk(i,J) 

*  H 

4(i,j) 

if 

Tk(i,j>  >  0 

(4.109) 


In  particular,  if  Tk  is  positive  definite  the  diagonal  elements  of  U^k 
have  their  largest  possible  value,  a  result  that  was  achieved  employing 
different  techniques  in  [13]  for  observer  design  with  uncertainties  only  in 
the  autocovariances  of  the  process  and  observation  noises. 


ii)  Consider  the  following  model  'or  the  deviation  of  the  time  varying 
matrices  from  a  given  nominal: 


({VE*  ’  \«  B*Xb.«<J-0(O,ll  <a) 


(4.110) 


with  the  possible  preservation  of  definiteness,  as  before.  The  norm  of 

(4.110)  is  the  root  mean  square  of  the  Euclidean  norm,  i.e. 


,  M  a  b 

HUH"  -  2  2  2  (U.(i,j)) 

k-k  i-1  j-1 
o 


(4.111) 


and  it  is  easy  to  check  chat  coincides  with  the  norm  associated  with  the 


6 


inner  produce 

M 

<  A,B>  ■  tr{  2  ^  B^}  (4.112) 

k»k 

o 

where  A^,  S  1R  . 

Note  Chat  this  definition  of  the  norm  for  matrices  of  arbitrary  frame 
allows  the  accomodation  of  uncertainties  in  the  crosscovariance  ♦.  Convexity 
of  the  set  Lj  follows  directly  from  the  Schwarz  inequality,  thus  it  is 
possible  to  apply  the  conditions  for  least  favorability,  that  can  be  expressed 

in  general  as 

U  -  arg  ext  <  U,T>  (4.113) 

U  €  y 

where  ext  is  max/min  for  auto/cross-covariances  respectively.  The  solution 
of  (4.113)  is  (Proposition  1,  Chapter  3) 

UL  -  U(o)  +  cT  (4.114) 

where  e  is  a  nonnegative  scalar  such  that  is  an  extremal  element  of  y, 
and  should  be  computed  recursively.  For  example,  suppose  that  the  cross* 
covariance  of  the  process  and  observation  noise  is  uncertain  around  an 
uncorrelated  nominal  model.  Then  for  the  cases  exposed  in  Sections  2  and 
3,  the  least  favorable  conditions  are  (4.35),  (4.42),  (4.67),  with  the 
common  form 

N-2 

»  arg  min  tr{  2  * .  K. ,  W  }  (4.115) 

L  *  €  P  j«k  3  3  l 

J  o 


if  the  uncertainty  class  (4.110)  is  used,  the  least  favorable  is 
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*Lj  -  -£  Vi  hi  (4,116) 

Substituting  this  into  (4.38)  the  Kalman  gain  for  the  robust  filter  is 

-  A  2L  CT  [  C  ZL  CT  +  0  +  e  WI]  _1  (4.117) 

-  t  (C  2L  CT  +  0)  1  +  e  wf 1  A  ZL  CT  (4.118) 

when  the  dimension  of  the  system  (n)  and  observation  (r)  is  one  respectively. 

In  order  to  treat  uncertainties  in  the  covariance  of  the  initial  condi¬ 
tion,  or  when  the  other  unknown  matrices  are  restricted  to  be  constant,  we 

can  use  (4.110)  as  the  deviation  class  with  the  additional  assumption  that 
U  is  time  invariant  and  the  norm  (4.111)  is  not  summed  over  time.  Then  we 
define  the  inner  product  as 

<A,B>  *  tr{A  BT)  (4.119) 

and  the  least  favorable  has  the  form 

,  x  M 

UT  *  IT0-'  +  e  £  T  (4.120) 

L  k-k  K 

o 

if  dealing  with  2,  ♦or  0,  and  (4.114)  for 
4.6  Conclusions 

The  application  of  the  general  formulation  of  minimax  robust  filtering 
has  allowed  us  to  present  minimax  theorems  for  predictor  and  filter  state 
estimation  and  for  quadratic  control,  under  general  classes  of  uncertain¬ 
ties  in  the  second  order  statistics  of  the  linear  stochastic  system.  These 
results  represent  a  generalization  of  earlier  works  [1],(2]  devoted  to  the 
steady-state  case  for  invariant  continuous-time  systems  with  uncorrelated 
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noises.  The  minimax  theorems  state  that  the  minimax  filter  is  the  optimal 
for  the  least  favorable  collection  of  noise  covariances;  this  saddle  point 
property  implies  a  very  attractive  feature,  namely,  that  when  the  actual 
covariances  differ  from  the  least  favorable  ones,  the  performance  of  the 
filter  is  upgraded.  Sets  of  necessary  and  sufficient  conditions  are  given 
for  the  least  favorability  of  the  four  types  of  covariance  matrices  Involved. 
These  conditions  lend  themselves  to  a  recursive  solution,  and  are  applied 
successfully  .to  deviation  classes  of  practical  interest.  It  is  worth  to 
underscore  that  under  the  same  type  of  uncertainty  the  worst  case  covari¬ 
ances  do  not  necessarily  coincide  for  the  three  types  of  filters  treated 
here,  in  particular  the  observer  used  for  the  minimax  regulator  is  not  in 
general,  a  minimax  predictor  state  estimator. 

Given  a  particular  expected  deviation  behavior,  the  tradeoff  between 
the  decrease  of  performance  (with  respect  to  the  nominal  filter)  in  the 
nominal  model  and  the  improvement  of  the  worst  case,  should  be  assessed 
for  the  practical  application  of  the  minimax  filters  presented.  The  soft 
minimax  approach  has  been  used  to  deal  with  unknown  means,  and  the  con¬ 
nection  with  earlier  works  for  estimators  with  unknown  forcing  functions, 
has  been  shown.  If  the  covariance  uncertainty  classes  can  be  modeled, 
as  well,  including  nominals  that  are  more  likely  to  occur,  their  soft 
minimax  solution  is  straightforward  from  the  previous  results. 
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Appendix  1 

Proof  of  the  existence  of  the  linear  term  of  the  Taylor  series  of  K 
in  powers  of  a. 

Denoting  5*  ■  lim  —  (tr(a)  -  w  ( 0) ) ,  differentiating  (3.17)  and  (3.18) 
a  o 

6K*  -  ([A  52k  CT  +  X]  -  [  A  CT  +  *  ]  R_1  [C  8^  CT  +  Y])R_1  (A.  1.1) 

«Sk+1  .  *  »Sk  AT  -  [  A  C1  +  X]  R*1  [  A  I,  C1  ♦  *,1 1 

+  [  A  SL  CT  +  'PJ  R*1  [  C  5Sk  CT  +  Y  ]  R"1  [  A  SL  CT  +  T 

-  1  A  SL  CT  +  *L]  R"1  [  C  52k  AT  +  XT]  +  V  (A.1.2) 

T 

with  52k  -  Z  and  R  *  C  2^  C  +  ©L>  but  since  ©L  is  positive  definite  and 

o 

is  positive  semidefinite,  R  is  nonsingular.  Thus  5k  exists  for  every 
time  k,  and  all  €  C'  . 
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For  Che  case  of  uncorrelated  and  time  invariant  noises  with  a  priori 
known  covariance  of  initial  state  and  first  order  statistics,  a  proof  of 
the  minimax  theorem  for  Kalman  gain  design  in  continuous  time,  has 
appeared  previously  in  the  literature  [19].  The  authors  show  the  con¬ 
cavity  in  the  uncertainties  and  the  convexity  in  the  gain  matrix,  of  the  - 
payoff  function,  and  therefore  are  able  to  apply  a  standard  game  theoretic 
theorem.  Unfortunately  the  payoff  function  is  not  convex  in  the  Kalman 
gain  as  can  be  illustrated  by  a  simple  one-dimensional  continuous  time 
example: 


x  ■  -x 
y  »  x  +  w 


where  the  initial  state  at  t  ■  0  is  known,  and  the  variance  of  the  white 
observation  noise  w  is  assumed  to  be  9.  The  mean-square  error  of  an 
observer  with  constant  gain  K,  at  time  t  *  ^  is  easily  obtained 

E  [e2(y)  ]  -  |e  (K2/  (K  +  1)  )  (1  -  exp(-K  -  1)) 


a  function  that  is  not  coavex  in  K.  (Obviously  the  optimal  K  is  zero, 
since  no  observations  are  needed  in  order  to  estimate  the  state  without 
error. ) 

The  mistake  in  the  proof  of  the  convexity,  is  committed  by  neglecting 
Che  dependence  of  the  error  covariance  on  the  gain  K  when  taking  the 
derivative  of  a  Riccati  equation  with  respect  to  K. 
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5 .  OTHER  APPLICATIONS 


5.1  Wiener  Filtering 

Wiener  filtering  is  one  of  the  most  developed  applications  of  minimax 
robust  filtering  to  date.  The  precursor  work  of  Kassam  and  Lim  [26]  was 
followed  by  the  formalization  of  Poor  (27]  and  the  study  of  the  discrete¬ 
time  causal  Wiener  filtering  problem  by  Vastola  and  Poor  [28]  .  The  purpose 
of  this  section  is  to  illustrate  the  application  of  the  results  of  Chapter 
2  to  the  noncausal  uncorrelated  case  (the  general  situation  can  be  handled 
in  a  similar  fashion).  While  the  application  of  the  robust  filtering  theorem 
leads  to  a  slight  generalization  (the  signal  and  noise  uncertainties  are  not 
required  to  be  independent)  of  Theorem  1  of  [27]  ,  the  uncertainties  separa¬ 
tion  theorem  results  in  a  set  of  conditions  for  least-favorability  similar 
to  those  given  in  Chapters  3  and  4.  (Note  incidentally  that  in  the  area  of 
robust  Wiener  filtering  an  atypical  definition  of  least-favorability  has  been 
used.  This  confusion  has  its  origin  in  an  obscure  discussion  in  [31]  .)  Let 
us  consider  the  penalty  function 


E(H;s,n) 


(°° 

[s(w)  |K(w)  -  H(w) 

_oo 


+  n  (w)  ]  H  (w)  |  ]  do; 


(5.1) 


where  H  £  X,  (s,n)  €  P  C  F  x F  with  X  and  F  representing  some  adequate  spaces 
of  filter  transfer  functions  and  power  spectral  densities,  respectively. 

Theorem  5.1 

Suppose  P  is  convex,  then  if  (s^,n^)  is  a  least  favorable  for  (P,X,E) 

s.  (w) 

V">  •  <5'2> 

is  a  minimax  robust  filter  for  the  game  (P,X,E). 
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Proof 


Note  that  the  penalty  function  (5.1)  is  linear  (and  therefore  concave) 
in  the  uncertainties.  (Moreover,  it  is  convex  in  the  set  of  filters,  thus 
standard  minimax  theorems  could  be  used  in  this  proof.)  So  it  is  sufficient 
to  prove  that  (H^^ 
via  the  robust  filtering  theorem. 

Let  (a,v)  *  (l-a)(sL,n^)  +  a(s,n),  where  (s,n)  S  P.  Then 

E  (a,v)  -  E(H^;o ,v)  - 


sL«n^))  is  a  regular  pair  in  order  to  prove  the  result 


g  (cj)  y  (io) 
a  (ui)  +  v  (w) 


-  a  (o>)  |K(w)  -  H^w)  | 2  +  v(w)  Ih^O-j)  | 2  dw 


|k<«) 


g  (eg)  v  (to) 


g  (cj)  +  v  (w) 


r  nL^  ~j2  r  sl(w)  “I 

°  ^  L?l  ^  +  “l  (w)J  V  ^  L?T  ^  +  nT 


sL(u>) 
L'J)  +  IV 


dw 


■r- 


[a  (w)nL  (<j)  -  v  (co)  sL  (cj)] 

[i^  («)  +sL(w)]  2(a  (w)  +v(w)) 


|K(w)  |  dcj 


00  2  [s(w)nL(w)  -  n(w)sL(w)] 

“  a  - 2 - 

-«  [nL(w)  +sL(w)]  (g(cg)+v(w)) 


|K(w) 


do; 


(5.3) 


Hence,  for  every  (s,n)  €  p 


lim  i  [E*(o,v)  -  E (H.  ;a,v)]  -  0 
a  +  0  a  L 


Theorem  5.2 

Suppose  the  uncertainties  in  signal  and  noise  and  independent,  i.e. 
P  -  $  x  N. 

((sL,nL),H^)  is  a  saddle  point  of  (S*N,JC, E)  if  and  only  if 
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SL(W) 


2. 


3. 


VW)  “  K(W)  ST'(J)  +  n  (co)  * 

L  L 

sT  *  argmax  s  («)  |  K(w)  -  E.  (w)  |  2  dw  , 
L  S6S  •’-»  X 

nT  •  argmax  n(w)  j H_  (<*j)  |  ^  dto  . 

L  n6N  X 


Proof 


The  payoff  function  can  be  written  as 


-E(H;s,n)  =  f^H.s)  +  f2(H,n)  , 


j^CH.s)  -  -J” 

f. 


s  (co)  |  K(CJ)  -  H(w)  I  dcj  , 


f2(H,n) 


(w)  |H(w)  I  , 


(5.2) 


(5.4) 


(5.5) 


and  therefore  the  uncertainties  separation  theorem  (Thm.  2.2)  can  be  used. 

Note  that  (5.5)  is  a  special  case  of  (3.10)  and  therefore  the  previously 
derived  results  for  matched  filtering  with  noise  uncertainty  are  useful  here. 
Apropos  of  signal  uncertainty  the  following  result  can  be  proved  similarly 
to  Prop.  3.1. 

Proposition 

The  least  favorable  signal  power  spectral  density  in  the  class 

S  -  js(w)  S  F:  P  |s0(u)  -  s(w)  |2  dw<A2j  (5.6) 


is  given  by 
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sL(w)  =  s Q(to)  +  5  |K(o>)  -  H^w)!' 


where 


5  =  A/[ 


|K(w)  -  H^w)  |4  dw  ) 


(5.7) 

(5.8) 


At  this  point,  the  reader  should  not  be  confident  on  a  theory  that  leads 
to  a  result  such  as  the  proposition  above.  Suppose  that  the  power  spectral 
density  of  the  noise  is  fixed  and  is  bounded  away  from  zero.  Let  t|  be  a 
constant  such  that 


sup 

u6l 


0 


s0(u>)  +  5  jK(w)  -  ^(w) 


<  V  <  1 


Then,  straightforward  manipulations  show  that  Vs  €  S.  How  is  it  possible 
that  an  attenuated  version  of  s^  is  more  favorable  than  s^  itself?  Indeed, 
the  Wiener  filter  for  s^  gives  larger  mean  square  error  than  the  Wiener 
filter  for  i?s The  clue  of  the  question  is  that  we  are  not  using  the 
penalty  function  that  we  should;  in  practice  it  is  more  meaningful,  in 
general,  to  minimize  the  MSE  relative  to  the  signal  power  than  the  MSE 
itself.  Of  course  in  the  original  Wiener  filtering  problem  (with  fixed 
input  PSD's)  both  criteria  coincide;  however,  in  robust  Wiener  filtering 
a  different  penalty  function  should  be  used,  e.g.  the  MSE  relative  to  the 
input  signal  power. 


L(H;s,n) 


!°° 

_oo 


[s(w)  |K(«)  -  H(w)  |  ^  +  n(u))  | H(w)  |  2]  dw 


dco 


(5.9) 


Note  that  when  the  signal  uncertainty  class  is  such  that  all  its  members 
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have  the  same  power  (like  those  in  [26]  and  [27]),  this  is  equivalent  to 
(5.1).  However,  none  of  the  results  developed  in  Chapter  2  can  be  applied 
to  this  function,  on  one  hand  it  is  not  concave  in  the  signal,  and  on  the 
other  it  cannot  be  put  in  the  form  required  by  the  uncertainties  separa¬ 
tion  theorem.  Therefore,  robust  Wiener  filtering  in  this  more  general 
setting  remains  an  interesting  open  problem. 


5.2  Quadratic  Receivers 

The  optimum  quadratic  receiver  for  the  detection  of  a  stochastic 
signal  imbedded  in  additive  noise,  maximizes  the  deflection  of  the  quadratic 
test  statistic  <s+n,  H(s+n)>;  if  the  noise  is  gaussian,  the  deflection  can 
be  expressed  as  [23] 


tr2  {H  K  } 

D(H;Ks,Kn)  =  tr  (hk  HK  } 
n  n 


(5.10) 


where  Kg  and  are  the  signal  and  noise  autocorrelation  operators.  The 
optimal  filter  is 

H*(K  ,K  )  *  K-1  K  K_1  (5.11) 

v  s  n  n  s  n 

achieving  a  maximum  deflection 

d*<w  ■ tr  <KS  C K,  C1  l5-12) 

Several  generalized  signal-to-noise  ratios  have  been  used  besides 


the  deflection  [23]  .  In  [24]  it  was  given  a  generalization  of  (5.10)  — 
involving  the  variance  of  the  test  under  the  signal  present  hypothesis  — 
that  allows  an  analytic  solution  for  the  optimal  filter,  namely 
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tr  {HK  } 

s  _ 

tr{a„  HK  HK  +  2a„a,  HK  HK  +  a2  HK  HK  } 
Onn  Olns  1  ss 


(5.13) 


H*(K  ,K  )  *  (a.K  +  a, K  )-1  K  (a.K  +  a,K  )-1 
s'  n  'On  1  s'  s  0  n  Is 


In  order  to  apply  the  robust  filtering  theorem  to  these  instances,  we  need 

to  check  the  convexity  of  the  payoff  functions  in  the  possible  uncertainties. 

It  is  easy  to  see  (by  counterexample)  that  neither  (5.10)  nor  (5.13)  are 

convex  in  the  noise  operator  for  arbitrary  Kg.  And  while  (5.13)  is  not 

convex  in  Kg  for  arbitrary  a^,  a^,  K^,  (it  can  be  shown  taking  derivatives 

in  the  one-dimensional  case),  (5.10)  is  convex  in  K  : 

s 

tr2(H[(l-a)Kgl+aKs2l)  <  (1  -  a)tr2(HKgl)  +  atr2(HKs2) 


where  the  inequality  follows  from  the  linearity  of  the  trace  and  the  con- 
2 

vexity  of  f(x)  =  x  .  Therefore  the  robust  filtering  theorem  can  only  be 
applied  to  the  maximum-deflection  quadratic  receiver  for  fixed  noise 
autocorrelation. 


Theorem  5.3 

Suppose  the  signal  autocorrelation  uncertainty  set  S  is  convex,  and 
K^  is  the  least  favorable  signal  autocorrelation,  then  H^  *  K^  K^  K^2 
is  a  robust  filter  for  (3C,S,D). 


Proof 


All  we  need  to  prove  is  that  (H^K^)  is  a  regular  pair.  Let 


(1  -  a)KT  +  aK,  K  €  S 

L 
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D*(Ks,Kn)  -  DCH^K^)  - 

tttK;1  Ka  r1  Ks)  trtl^  y  -  tr2(Hi  Kj) 

V 

■  THSfSjT  [tr(HL  V1'1'"’2  “l  'C  V 

+  2a(l  -  a)  trU"1  K.  K"1  K}  +  a2  tr{K-1  K  K_1  K}] 
n  l  n  n  n 

-  ( (1  -  a)  tr{HL  K^}  +  a  trO^  K})^ 

2 

- 2 -  [tr{K-1  K  K_1  K}  tr{H.  K.  }  -  tr2{H_  K}]  (5.14) 

tr{HL  1^}  n  n  X  X  X 

therefore  (H^.K^)  is  indeed  a  regular  pair  for  every  €  S. 


When  there  are  uncertainties  in  the  noise  autocorrelation  the  problem 
can  still  be  solved  via  the  following  direct  application  of  the  uncertainties 
separation  theorem. 


Theorem  5.4 


Suppose  the  uncertainties  in  the  signal  and  noise  autocorrelation 
operators  are  independent.  (H^, (KsL»KnL))  is  a  saddle  point  of  (3C,S*N,D) 
if  and  only  if  the  following  conditions  hold: 


h  ‘  \,l  hi  • 


2.  K  T  -  min  tr{IL  K  } 

SL  K  £5  ^  s 

s 


K  .  *  max  tr{H.  K  HT  K  } 
nij  ^  m  L  n  L  n 

n 


(5.15) 

(5.16) 


3. 


(5.17) 
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Note  that  condition  2  has  the  same  form  that  the  least  favorability  condi¬ 
tions  did  in  Chapter  4.  In  particular,  the  results  in  Section  4.5  hold 
for  the  signal  uncertainty  here  with  minor  modifications.  Note  also  that 
the  uncertainties  separation  theorem  cannot  be  applied  to  the  generalized 
criterion  (5.13),  providing  another  example  in  which  neither  of  the  theorems 
in  Chapter  2  can  be  used. 


5.3  Output-Energy  Filter 

When  the  received  signal  is  random,  sometimes  a  linear  receiver  that 
maximizes  the  signal  to  noise  ratio  is  used  [25]  .  In  this  case  the  output 
signal-to-noise  ratio  is  given  by: 


R(h;Ks,Kn) 


Energy  of  output  due  to  signal 
Energy  of  output  due  to  noise 


<  h,  K  h> 

s 

<  h,  K  h)  ’ 

n 


(5.18) 


where  as  before  K  and  K  are  the  signal  and  noise  autocorrelation  operators. 

S  XI 

Further  justification  of  this  criterion  of  optimality  is  possible  considering 
the  SNR  of  (3.1)  as  a  random  variable,  in  which  case  its  expected  value  coin¬ 
cides  with  (5.18).  Moreover,  consider  the  special  type  of  quadratic  receiver 
consisting  of  a  linear  filter  followed  by  a  square  law  device.  If  the  signal 
and  noise  are  jointly  gaussian  processes,  the  variance  of  the  output  of  the 
receiver  is  proportional  to  the  square  of  the  variance  of  the  output  of  the 
filter,  and  therefore  the  deflection  of  the  output  of  the  receiver  is  propor¬ 
tional  to  the  square  of  the  energy  ratio  (5.18). 

The  optimal  output-energy  filter  is 

•k  —1 

h  (Ks»Kn)  *  eigenvector  of  Kg  with  maximum  eigenvalue  (5.19) 


This  can  be  proved  analogously  to  Theorem  X.13  of  [29]  . 
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Claim 

R(h;Kg,Kn)  is 

1.  Convex  in  Kg  for  arbitrary  h  and  Kn; 

2.  Convex  in  for  arbitrary  h  and  Kg; 

3.  Not  convex  in  (Kg,Kn)  f°r  arbitrary  h. 

Proof 

The  first  assertion  is  evident  since  R(h,Ks,Kn)  is  linear  in  Kg. 

To  prove  the  second  we  use  the  method  of  the  proof  of  Lemma  6  of  [30]  .  Let 

.  <  h,  Kih  > 

K  =*  (l-a)K,  +  aK„  and  B  =  a  j -r — nr-r-  .  Then 
n  1  L  Ui,  Kh ) 


<  h,  K  h>  (h,  Kh> 
n  n 


a  >--■  +  (1  -  a)  ' 


<  h,  Kh) 
n 


<  h,  Kjh >  +  (h,  K, 


h  > 


<  a 


<h,  Kxh>  +  (1_a)  <h,  K2h) 


(5.20) 


where  the  inequality  follows  from  B  <  a  (K^,K2  are  non-negative  operators 
by  assumption) .  The  third  assertion  is  easily  verified  bj  a  one-dimensional 
counterexample.  (Let  (Ks,Kn)  *  *5(4,2)  +  *s(l,l).) 

Therefore  the  robust  filtering  theorem,  if  applicable,  will  treat 
uncertainties  either  in  the  signal  or  noise  operator  but  not  in  both. 
However,  a  general  result  of  the  kind  that  we  have  presented  for  all  the 
previous  filtering  situations  is  not  possible  here,  because  the  existence 
of  regular  pairs  for  given  operators  Kg^  and  KnL  will  depend  on  every 
particular  uncertainty  class.  As  an  example  consider  the  simple  discrete¬ 
time  case  in  which  K  *  1  and  we  have  some  class  5  of  diagonal  matrices. 


I 
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Then  any  matrix  with  distinct  diagonal  elements  forms  a  regular  pair  with 
its  maximum-eigenvalue  eigenvector;  however,  any  matrix  whose  maximum- 
eigenvalue  eigenspace  has  dimension  greater  than  unity  does  not  form  a 
regular  pair  with  any  vector.  In  order  to  see  this  let  K  *  diag{s. , . . . ,s  }, 
-  diag{£1> . . .  ,  K  =  diagfk^ . . .  ,kn>,  Ks  »  (1-a)^  +  aK.  Then  (h^KjJ 

is  a  regular  pair  if  and  only  if  for  every  K  6  S 

hTK  h  ||  h.  II2  -  h?K  hT  JlhlJ  2  *  o  (a)  (5.21) 

S  la  L  S  L 

where  h  is  a  maximum  eigenvalue  eigenvector  of  K. 

If  i  for  all  pairs  there  exists  a  positive  8  such  that  0  <  a  <  8 
implies  that  h  *  hL  for  all  K  €  S,  hence  (5.21)  is  true.  Nevertheless,  if 
max(-£^, . . . ,£n)  is  not  unique  (say  max(£^» . . . ,£n)  *  *  l 2)  there  is  no  h^ 

for  which  (5.21)  can  hold  for  every  K  €  S.  Of  course  (let  K  *  K^) ,  in  order 
for  (5.21)  to  be  true  it  is  necessary  that  h^  is  an  eigenvector  of  with 
maximum  eigenvalue,  i.e.  a(l,0,...,0)  and  b(0,l,0, . . . ,0) ,  but  if  k^  <  k^  and 
h^  *  a(l,0,...,0)  (analogously  if  k2  <  k^  and  *  b (0,1,0, . . . ,0)) 

hTK  h  II  h.  II2  -  hjx  h  llhll  2  -  llhll  2a2a(k,  -  l  )  =  0(a)  (5.22) 

S  la  L  S  L  c  1 

Directly  from  the  uncertainties  separation  theorem  we  have 

Theorem  5.5 

Suppose  the  uncertainties  in  the  signal  and  noise  autocorrelation 

operators  are  independent.  Then  (h.,(K  _,K  r))  is  a  saddle  point  of 

L  SL  tlL 

(■X,S*N,  R)  if  and  only  if 

hT  is  an  eigenvector  of  K  ,  with  maximum  eigenvalue, 

L  nL  sL 


1. 
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Note  that  (5.24)  is  the  same  as  condition  3  of  Theorem  3.2,  and  that  if  there 
are  minimal  and  maximal  elements  in  S  and  N  respectively,  they  are  the  solu¬ 
tion  of  (5.23)  and  (5.24). 


5.4  Hypothesis  Testing  and  Estimation  of  Location 

Our  main  goal  here  is  to  illustrate  the  application  of  the  results 
previously  derived  for  minimax  robust  filtering,  to  other  problems;  specif¬ 
ically  in  this  last  section  we  deal  with  the  cases  of  robust  hypothesis 
testing  [31]  and  robust  estimation  of  a  location  parameter  [30]  ,  regarded 
as  the  classical  starting  points  of  minimax  robust  procedures  in  statistics 
and  signal  processing. 

In  the  first  problem,  Huber  [31]  presents  the  robust  solution  for  a 
particular  uncertainty  set  of  probability  measures,  namely  a  mixture  class; 
however,  to  the  author's  knowledge,  no  result  for  general  uncertainty 
classes  is  available  yet.  The  application  of  the  robust  filtering  theorem, 
although  it  imposes  some  restrictions  on  the  possible  uncertainty  classes, 
in  a  step  forward  in  that  direction.  On  the  other  hand,  in  the  robust 
estimation  problem,  Huber  does  provide  a  general  theorem  (T.2  of  [30])  for 
convex  uncertainty  classes;  here  we  will  rederive  that  result  via  the 
robust  filtering  theorem.  (It  is  worth  noting  that  the  game-theoretic 
results  of  robust  filtering  (e.g.  [3], [27])  were  in  good  part  inspired  by 
Huber's  proof  of  (T.2  [30]).) 
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Suppose  the  problem  is  to  decide  with  minimum  probability  of  error 
(any  other  Bayes  test  can  be  treated  similarly)  between  two  probability 
measures  upon  observation  of  a  random  vector  X.  Hence  the  penalty  function 
is 


P 

e 


<*’P0’P1> 


n„ 


0  d  Pn  +  n. 


n 


(l-*)dP. 
Jn  J 


(5.25) 


where  the  test  0 (x)  €  $  is  the  conditional  probability  of  deciding  given 
that  X  is  observed.  The  test  that  minimizes  (5.25)  is  the  likelihood  ratio 
test : 


0W 


1 


tc 


O 


(5.26) 


with  arbitrary  k. 

Now  we  assume  that  the  pair  of  measures  (P^.P ^)  is  only  known  to 
belong  to  a  certain  set  With  respect  to  the  game  (£,$,Pe)  we  state 
and  prove  the  following. 


Theorem  5.6 


Suppose  that 

1.  The  uncertainty  set  of  probability  measures  Q  is  convex;  and 

2. 


for  every  pair  (Mq,M^)  e  Q,, 


(  rd*  n0  dP1L 

<  - 


dPU)  f,  fdf,lL  .  "<>  <  d,Mll 


dx 


(5.27) 
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where  I  denotes  the  indicator  function.  Then,  if  (Pq^,P^)  is  ^east 

favorable  for  (O.'J’.P  )  it  has  an  optimal  test  which  is  minimax  robust. 

6 


Proof 

Since  (5.25)  is  linear  in  Pq  and  P^  it  follows  that  it  is  concave  in 
(Pq,P^).  Note  that  it  is  also  concave  in  the  priors,  but  it  is  not  concave 
on  (Hq.PqjII^jP^) .  (Since  it  is  also  convex  in  $,  the  application  of  concave- 
convex  minimax  theorems  could  be  investigated.)  Therefore  all  we  have  to 
show  is  that  for  some  test  <t>^,  ((Pql,P1L^  a  re8u^ar  Pa*r*  To  this 

end  consider  (Pq»P^)  *  (1  -  at)  (PqL,P1l)  +  a^o*^l^  ^or  an  arbitrary  (Mq,M^) 

€  <2.,  then 


VVV  -  Pe(*L-P0-Pl> 


n 


(d-^H^-n^) 


(5.28) 


with  <t>  an  optimal  test  for  (P^.P^). 

The  region  of  integration  of  (5.28)  will  be  divided  into  9  regions: 


li  ■  { I: 
v2  -  {  x: 

S  *  {  X: 


dP  n„  dP 
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dPiL  n„  dP 
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ni  dPo  / 
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A6 
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A 
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dP 

dP 


1L 
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> 
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dPOL  "l 


dP 


1L 


dP 


1L 
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} 

} 

} 
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Thus,  using  a  slight  abuse  of  notation, 


P  (Pn,P,)  -  P  ($T,Pn,P,) 

e  U  1  e  L  0  1 


+ 

(1-0  + 

(k  -  kt  )  + 

K  +[(<-!)- 

’ 

A,  J 

A„  L 

A,  L 

A,  J  A, 

8  A9 


I'WW- 


Now  observe  that  we  can  write 


(5.29) 


"odPo  -  "idPi 


%  flL 

‘"I  '  dP0L 


dPlL  dPl 
dP0L  '  dP0)nidP° 


^0  dPlL  dPlL  ^1 

X  '  +  a  *  %  ‘VV  W* 


(5.30) 


Therefore,  taking  into  account  the  definition  of  the  sets  A^, 


o  [Pe(P0’Pl>  “  Pe<*L’P0’Pl)]  * 
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and 


a-KL)  - 
A2 


dPlL  ni 

^L]  (dPT  r r  ("o^o  -  W  = 

Ag  OL  0 


[  f(i-s) 

A 


*L  1  Vo  -  W  * 


(5.32) 


which  is  always  zero  with  an  appropriate  choice  of  k  ,  namely,  kt  *  0  or  1 

Li  la 

if  the  first  or  second  integral,  respectively,  of  (5.27)  is  nonzero.  If 
both  integrals  are  zero  the  value  of  tc^  is  irrelevant.  Note  that  we  have 
used  the  fact  —  see  eq.  (5.30)  —  that 


dP 


1L 


dP, 


OL 


1£ 

JP0  "l  dP0L 


Since  the  RN  derivatives  are  right  continuous  at  a  * 
reduce  to  null  sets  as  a  4-  0;  therefore  (5.31)  and  (5.32) 


0,  A^,A^,A^,Ag 
result  in 


llm„ ;  tpX'pi>  -  WW1  * 0  •  <5-33) 


But  for  every  a, 

Pe(P0’Pl>  <  ?e«l'W  ’  <5’34> 

therefore  the  regularity  of  ( (^ql’  P1L^  is  demonstrr,,:ec**  is  interesting 
to  underline  that  although  the  probability  of  deciding  one  of  the  hypotheses 
when  the  likelihood  ratio  is  equal  to  the  threshold  is  irrelevant  for  fixed 
probability  measures,  there  are  instances  in  which  this  is  no  longer  the  case 
for  the  robust  test,  as  we  have  just  seen. 
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In  Huber's  famous  formulation  of  the  problem  of  robust  estimation  of 
location  [30],  the  statistician  selects  an  influence  curve  €  'I'  in  order 
to  maximize  the  payoff 


ROM) 


(/* f’dt)2 
j>2fdt 


(5.35) 


where  f  is  the  marginal  probability  density  of  the  (independent)  errors  in 
the  location  measurements.  The  maximum  value  of  (5.35)  is  achieved  with 
t h  «  -f'/f  and  is  given  by  (via  the  Schwarz  inequality) 


•k 

K  (f)  - 


(5.36) 


If  the  density  function  f  lies  in  a  convex  set  F,  our  purpose  is  to 
find  the  robust  estimator  for  the  game  Ok.F.K).  Theorem  2  of  [30]  states  that 
the  robust  estimator  is  the  optimal  for  the  least  favorable  density  f^. 

A  proof  of  that  theorem  can  be  carried  out  by  showing  that  (^.f^)  is  a 
regular  pair  (Lemma  6  of  [30]  shows  that  K  is  convex  in  the  uncertainties). 

Let  f  =  (1  -  a) f L  +  ag  with  g  €  F;  then 


k  [K*(f)  -  K(*.,f)] 
a  L 


(j*Lfdt)2 

a/^2fdt 


afK  fdt 


f'(s)*L(t) 
f (8) 


[f'(s)f(t)^L(t) 


f  (s)f '  ( t)«^L (s)]  dtds 


(5.37) 


If  m(t,s)  -  (fL(s)g'(t)  +  g(s)f[(t))«L(s),  the  term  in  brackets  is 
a[m(t,s)  -  m(s,t)]  +  o (a) .  Hence,  when  a  r  0,  (5.37)  results  in 
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r 

f 
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lim  -  [K*(f)  -  K«l tf)]  = 
a  +  0  a  L 


^L(s)^L^t^m(t,s)  "  m(s,t)]dtds  *  0 


This  demonstrates  that  ,f  )  is  a  regular  pair,  and  thus  the  problem 
solved  by  Huber  fits  the  framework  developed  in  Chapter  2. 
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1 

1 
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6.  SUMMARY 


A  recapitulation  of  the  main  contributions  of  this  work  is  given  in 
this  chapter. 

First,  the  robust  filtering  theorem  presented  in  Chapter  2  gives  suf¬ 
ficient  conditions  on  the  performance  function  and  on  the  sets  of  filters 
and  operating  points  in  order  for  the  minimax  robust  filter  to  be  the  opti¬ 
mal  filter  for  the  least  favorable  operating  point.  In  this  theorem,  the 
central  issue  turns  out  to  be  the  introduction  of  the  concept  of  regular 
pairs  of  filter  and  operating  point.  By  means  of  this  result,  a  systematic 
procedure  is  developed  for  solving  problems  in  minimax  robust  filtering. 

So,  in  the  light  of  this  theorem,  several  previously  treated  problems  have 
been  revised  and  generalized,  and  other  new  problems  have  been  proposed  and 
solved.  A  technical  refinement  provided  by  the  robust  filtering  theorem 
has  been  the  introduction  of  an  equivalent  game  by  means  of  which  typical 
restrictions  of  the  uncertainty  classes  (pertaining  to  convexity  and 
existence  of  least  favorable  elements)  are  relaxed  in  certain  cases,  and 
that  allows  the  proof  of  extended  results  (class  theorems)  by  considering 
restricted  classes  of  filters.  The  next  general  result  that  we  presented, 
i.e.,  the  uncertainties  separation  theorem,  constitutes  a  useful  tool  when 
dealing  with  independent  (not  necessarily  convex)  uncertainty  classes. 

With  some  restrictions  on  the  performance  function,  this  theorem  results 
in  sets  of  equations  that  can  be  solved  for  the  robust  filter,  numerically 
or  analytically.  Next,  the  concept  of  soft  minimax  has  been  introduced; 
it  offers  the  possibility  of  incorporating  in  the  model  some  possible 
further  knowledge  about  the  uncertain  parameters  by  combining  hard  and 


soft  constraints. 
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Following  the  presentation  of  these  general  results  we  have  discussed 
their  application  to  several  specific  filtering  situations.  First  we  have 
given  a  complete  treatment  of  the  robust  matched  filtering  problem  following 
the  lines  of  the  general  work  by  Poor  [3];  it  is  shown  that  the  invertibility 
of  the  noise  covariance  operator  is  not  necessary  (but  it  is  sufficient)  in 
order  for  the  regularity  conditions  to  be  satisfied,  and  a  simple  set  of 
necessary  and  sufficient  conditions  for  a  pair  of  signal  and  noise  to  be 
least  favorable  is  derived  straightforwardly  from  the  uncertainties  separa¬ 
tion  theorem.  Specific  deviation  classes  are  studied  in  the  framework  of 
finite-length  discrete-time  processing  (although  the  majority  of  the  results 
presented  can  be  extended  to  the  more  general  Hilbert  space  setting).  Among 
other  conclusions  it  is  demonstrated  that,  when  the  deviations  of  the  signal 
and  of  the  noise  covariance  can  be  modeled  by  spheres  centered  on  given 
nominals,  the  robust  filter  is  the  one  matched  to  the  nominal  signal  and 
to  the  nominal  noise  with  an  additional  component  of  white  noise  (whose 
level  is  proportional  to  the  size  of  the  uncertainties);  hence,  in  the  fre¬ 
quent  event  in  which  the  nominal  noise  is  white  (or  in  which  the  nominal 
signal  is  designed  to  be  an  eigenvector  of  the  nominal  noise  covariance) 
the  nominal  matched  filter  is  minimax  robust.  Concerning  the  optimal 
design  of  nominal  signals  for  matched  filtering  in  the  presence  of  uncer¬ 
tainties,  two  extremal  cases  are  examined,  namely,  the  uncertainty 
model,  that  results  in  a  nonlinear  optimization  problem,  and  a  particular 
case  of  the  t  model,  whose  solution  is  the  minimum-eigenvalue  eigenvector 
of  the  noise  covariance;  i.e.,  the  optimal  signal  without  uncertainties. 

The  minimax  state  estimation  and  control  problem  is  addressed  next. 
General  uncertainty  classes  of  means  and  covariances  of  the  process  and 
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observation  white  noises  are  allowed,  and  the  linear  time-varying  transient 
case  is  analyzed.  Once  again,  the  general  results  of  Chapter  2  are  applied 
to  this  case,  leading  to  the  desired  necessary  and  sufficient  conditions 
for  least  favorability,  and  to  the  demonstration  of  the  minimax  robustness 
of  the  optimal  system  for  the  least  favorable  statistics.  In  each  of  the 
problems  that  we  consider,  distinct  conditions  for  least  favorability  are 
found;  in  particular,  in  relation  to  the  separation  theorem  of  stochastic 
control,  it  is  shown  that  while  the  controller  can  be  derived  separately, 
its  state  estimates  are  not  provided  by  the  same  Kalman  filter  as  for 
minimax  estimation.  In  order  to  deal  with  uncertain  means,  we  employ  the 
soft  minimax  approach  introduced  earlier;  sufficient  conditions  for  the 
nominal  system  to  be  (soft)  minlmax  robust  are  provided.  An  interesting 
aspect  is  the  fact  that  certain  previous  works  in  pursuit-evasion  games 
with  soft  energy  constraints  are  special  cases  of  the  problem  solved  here, 
and  as  such,  arrive  at  coincidental  solutions.  Several  examples  of  classes 
for  uncertain  noise  auto  and  crosscovariances  are  studied  in  detail.  Apart 
from  their  intrinsic  interest  they  serve  to  illustrate  the  parallelism  that 
we  have  encountered  in  dealing  with  analogous  uncertainty  classes  in  different 
filtering  situations.  This  can  be  found,  for  instance,  in  the  solution  to 
the  normed  deviation  model  and  in  the  transfer  of  uncertainties  from  one 
subclass  to  another  already  observed  in  the  matched  filtering  problem. 

Other  filtering  situations,  such  as  the  quadratic  receiver  and  the 
output  energy  filter,  in  which  the  minimax  robust  approach  has  not  been 
applied  before  are  also  Investigated.  In  these  cases,  we  point  out  several 
open  problems  for  which  the  general  results  are  not,  as  of  now,  applicable. 

For  instance,  no  generalized  performance  function  involving  the  variance 
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of  Che  output  under  the  signal-present  hypothesis  leads  to  a  solution  of 
the  robust  quadratic  receiver  problem.  Also,  finding  conditions  for 
regularity  in  the  output-energy  filtering  problem  —  a  generalization  of 
the  matched  filter  —  appears  as  a  difficult  and  interesting  question. 

The  well-known  problems  of  robust  location  estimation  and  of  robust  non- 
causal  Wiener  filtering  fit  the  general  framework  of  Chapter  2  and  alterna¬ 
tive  proofs  of  their  respective  minimax  robust  theorems  are  provided.  In 
the  case  of  Wiener  filtering,  tractable  and  more  meaningful  approaches  to 
deal  with  signal  uncertainty  classes  without  equal-power  constraints  should 
be  investigated.  Another  interesting  point  is  the  study  of  the  regularity 
condition  when  no  explicit  expressions  for  the  optimal  performance  achiev¬ 
able  at  every  operating  point  are  available  (e.g.  causal  filtering). 
Finally,  another  topic  that  deserves  further  attention  is  the  minimax 
hypothesis  testing  problem  for  nonspecific  uncertainty  classes;  although 
the  regularity  condition  (5.27)  required  by  our  results  does  not  seem  to 
impose  important  restrictions  in  practice,  its  further  relaxation  appears 
to  be  interesting. 


e- 
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